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5 IMMUNORE ACTIVE HEPATITIS C VIRUS POLYPEPTIDE COMPOSITIONS 

Technical Field 

This invention relates generally to 
immunoreactive polypeptide compositions , methods of using 
10 the compositions in immunological applications, and 
materials and methods for making the compositions. 

Background . 

The hepatitis C virus has been recently 

15 identified as the major causative agent of post- 
transfusion Non-A, Non-B hepatitis (NANHB) , as well as a 
significant cause of community- acquired NANBH. 
Materials and methods for obtaining the viral genomic 
sequences are known. See, e.g. PCT Publication nob. 

20 WO89/04669, WO90/11089 & WO90/14436. 

Molecular characterization of the HCV genome 
indicates that it is a RNA molecule of positive polarity 
containing approximately 10,000 nucleotides that encodes 
a polyprotein of about 3011 amino acids. Several lines 

25 of evidence suggest that HCV has a similar genetic 

organization to the viruses of the family Flaviviridae, 
which includes the flavi- and pestivirus. Like its 
pesti- and flaviviral relatives, HCV appears to encode a 
large polyprotein precursor from which individual viral 

30 proteins (both structural and non- structural) are 
processed. 

RNA- containing viruses can have relatively 
higtrrates of spontaneous mutation, i.e., reportedly on 

— the-order— ©£-; l-0^^to-10^^per~incorporated_nucleo.tlde_. 

35 Therefore, since heterogeneity and fluidity of genotype 
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are common in RNA viruses, there may.be multiple' viral 
isolates, which may be virulent or avirulent, within the 
HCV species. 

A number of different isolates of HCV have now 
5 been identified. The sequences of these isolates 

demonstrate the limited heterogeneity characteristic of 
RNA viruses. 

Isolate HCV Jl.l is described in Kubo, Y. et 
al. (1989), Japan. Nucl. Acids Res. XL: 10367 -10372 ; 
10 Takeuchi, K. et al.(1990), Gene 21:267-291; Takeuchi et 
al. (1990), J- Gen. Virol. 7.1:3027-3033; Takeuchi et al. 

(1990) , Nucl. Acids Res. lfL:4626. 

The complete coding sequences plus the 5'- and 
3 '-terminal sequences of two independent isolates, 
15 "HCV- J" and °BK B , are described by Kato et al. and 

Takamizawa et al, respectively. (Kato et al. (1990), 
Proc. Natl. Acad. Sci. USA 12:9524-9528; Takamizawa et al 

(1991) , J. Virol. ££:1105-1113.) 

Other publications describing HCV isolates are 

20 the following; 

"HCV-l": Choo et al (1990), Brit. Med. 
Bull. 4£:423-441; Choo et al. (1991), Proc. 
Natl. Acad. Sci. USA fifi.:2451-2455; Han et al. 
(1991), Proc. Natl. Acad. Sci. USA ji8.:i7il- 
25 1715; European Patent Publication No. 318,216. 

"HC-J1" and B HC-J4 B : Okamoto et al. 
(1991), Japan J. Exp. Med. £a:16 7 - 177 - 

■HCT 18", "HCT23-, B Th B , B HCT 27", B EC1 B 
and B EC10 B : Weiner et al. (1991), Virol. 
30 180:842-848. 

"Pt-1", B HCV-K1 B and "HCV-K2 " : Enomoto et 
al. There are two major types of hepatitis C 
" virus in Japan. Division of Gastroenterology, 

Departmentr~of _ Internal-Medicine7— Kanazawa 

35 Medical University, Japan. 
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Clones "A", "C" , "D" & "E": Tsufciyama- 
Kohara et al., A second group of hepatitis 
virus, in Virus Genes . 



5 A typical approach to diagnostic and vaccine 

strategy is to focus on conserved viral domains. This 
approach, however, suffers from the disadvantage of 
ignoring important epitopes that may lie in variable 
domains . 

10 It is an object of this invention to provide 

polypeptide compositions that are immunologically cross - 
reactive with multiple HCV isolates, particularly with 
respect to heterogeneous domains of the virus. 

15 Summary of the Invention 

It has been discovered that a number of 
important HCV epitopes vary among viral isolates, and 
that these epitopes can be mapped to particular domains. 
This discovery allows for a strategy of producing 

20 immunologically cross- reactive polypeptide compositions 

that focuses on variable (rather than conserved) domains. 

Accordingly, one embodiment of the present 
invention is an immunoreactive composition comprising 
polypeptides wherein the polypeptides comprise the amino 

25 acid sequence of an epitope within a first variable 

domain of HCV, and at least two heterogeneous amino acid 
sequences from the first variable domain of distinct HCV 
isolates are present in the composition. 

Another embodiment of the invention is an 

30 immunoreactive composition comprising a plurality of 

antigen sets, wherein (a) each antigen set consists of a 
plurality of substantially identical polypeptides 
carfprTsihg the amino acid sequence of an epitope within a 
firsTr~vaTiabTe~ domain _ of~an~HCV-isoiate— and— (b^the 

35 amino acid sequence of the epitope of one set is 
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heterogeneous with respect to the amino acid sequence of 
ghe analogous sequence of at least one other set. 

Another embodiment of the invention is 
an immunoreactive composition comprising a plurality of 
5 polypeptides wherein each polypeptide has the formula 

V (SV a ) x -RV 

wherein 

R and R' are amino acid sequences of about 
1-2000 amino acids, and are the same or different; 
10 r and r' are 0 or 1, and are the same or 

different; 

V is an amino acid sequence comprising the 
sequence of an HCV variable domain, wherein the variable 
domain comprises at least one epitope; 
!5 S in an integer > 1, representing a selected 

variable domain; and 

n is an integer > l, representing a selected 
HCV isolate heterogeneous at a given SV with respect to 
at least one other isolate having a different value for 
20 n f and n being independently selected for each x; 

x is an integer > l; and 
with the proviso that amino acid sequences are present in 
the composition representing a combination selected from 
the group consisting of (i) 1V X and 1V 2 , (ii) lV t and 2V 2 , 

25 and (iii) lVj and 2V,. 

Yet another embodiment of the invention is a 
method for preparing an immunogenic pharmaceutical 
composition HCV comprising: 

(a) providing an immunoreactive composition as 

30 described above; 

(b) providing a suitable excipient; and 

(c) mixing the immunoreactive composition of 

(a ) with the excipient of (b) in a proportion that 

provides an immunogenic response upon administration to a 
35 mammal. 
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Still another embodiment of the invention is a 
method for producing ant i -HCV antibodies comprising 
administering to a mammal an effective amount of 
immunoreactive composition as described above. 
5 Yet another embodiment of the invention is a 

method of detecting antibodies to HCV within a biological 
sample comprising: 

(a) providing a biological sample suspected of 
containing antibodies to HCV; 
10 (b) providing an immunoreactive composition 

described above; 

(c) reacting the biological sample of (a) with 
the immunoreactive composition of (b) under conditions 
which allow the formation of antigen- antibody complexes; 

15 and 

(d) detecting the formation of antigen- 
antibody complexes formed between the immunoreactive 
composition of (a) and the antibodies of the biological 
sample of (b) , if any. 

20 Another embodiment of the invention is a kit 

for detecting antibodies to HCV within a biological 
sample comprising an immunoreactive composition as 
described above packaged in a suitable container. 

25 grief Desc ription of the Figures 

Figure 1 schematically shows the genetic 
organization of the HCV genome. 

Figure 2 shows a comparison of the deduced 
amino acid sequences of the El protein encoded by group I 
30 and group II HCV isolates. 

Figure 3 shows a comparison of the amino acid 
sequences of the putative B2/NS1 region of HCV isolates. 

Figure 4 are graphs showing the antigenicity 
profiles - for - the amino- terminal region of - Che - putative 

35 
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HCV B2/NS1 protein (amino acids 384-420) , and the gp 120 
V3 hypervariable region of HZV-1. 

Figure 5 shows a series of graphs which give 
the percentage probabilities that a given residue from 
5 the amino -terminal region of HCV E2/NS1 protein (amino 
acids 384 to 420) will be found in either alpha -helix, 
beta -sheet or beta- turn secondary structural motif. 

Figure 6 are bar graphs showing the reactivity 
of antibodies in the plasma from HCV 18 (panels A-C) or 
10 Th (Panels D-f) with overlapping biotinylated 8mer 

peptides derived from amino acids 384 to 415 or 416 of 
HCV isolates HCT 18 (A f D) , Th (B,E) and HCV Jl (C,F) , 
respectively. 

Figure 7 shows the deduced amino acid sequences 
15 of two regions of the E2/NS1 polypeptide, amino acids 
384-414 and 547-647, given for the Ql and Q3 isolates. 

Figure 8A shows the deduced amino acid 
sequences of isolates HCV Jl.l and J1.2 from amino acids 
384 to 647. Figure 8B shows the deduced amino acid 
20 sequences of isolates HCT27 and HCVE1 from amino acids 
384 to 651. 

Figure 9 shows the entire polyprotein sequence 
of isolate HCV-1. 

25 Modes of Practicing the Invention 

The practice of the present invention will 
employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, 
recombinant DNA, and immunology, which are within the 

30 skill of the art. Such techniques are explained fully in 
the literature. See e.g., Maniatis, Fitsch & Sambrook, 
MQLBOILAR CLONING; A LABORATORY MANUAL (2nd ed. 1989); 
DMA CLONING, VOLUMES I AND II (D.N Glover ed. 1985); 
OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984); NUCLEIC 

35 ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins eds. 1984); 
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TRANSCRIPTION AND TRANSLATION (B.D. Hames & S.O". Higgins 
eds. 1984); ANIMAL CELL CULTURE (R.I. Freshney ed. 1986); 
"iMMOBILIZED CELLS AND ENZYMES (IRL Press, 1986); B. 
Perbal, A PRACTICAL GUIDE TO MOLECULAR CLONING (1984); 
5 the series, METHODS IN ENZYMOLOGY (Academic Press, Inc.); 
GENE TRANSFER VECTORS FOR MAMMALIAN CELLS (J.H. Miller 
and M.P. Calos eds. 1987, Cold Spring Harbor Laboratory) , 
Methods in Enzymology Vol. 154 and Vol. 155 (Wu and 
Grossman, and Wu, eds., respectively), Mayer and Walker, 

10 eds. (1987) , IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR 
BIOLOGY (Academic Press, London), Scopes, (1987), PROTEIN 
PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition 
(Springer- Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL IM- 
MUNOLOGY, VOLUMES I-rv (D.M. Weir and C. C. Blackwell eds 

15 1986); IMMUNOASSAY: A PRACTICAL GUIDE (D.W. Chan ed. 
1987). All patents, patent applications, and publica- 
tions mentioned herein, both above and below, are 
incorporated by reference herein. 

HCV is a new member of the Family Flaviviridae 

20 which includes the pestiviruses (Hog Cholera Virus and 
Bovine Viral Diarrhea virus) and the Flaviviruses, 
examples of which are Dengue and Yellow Fever Virus. A 
scheme of the genetic organization of HCV is shown in 
Figure 1. Similar to the flavi- and pestiviruses, HCV 

25 appears to encode a basic polypeptide domain ("CM at the 
N- terminus of the viral polyprotein followed by two 
glycoprotein domains ("El", "E2/NS1* 1 ), upstream of the 
nonstructural genes NS2 through NS5. The amino acid 
coordinates of the putative protein domains are shown in 

30 Table 1. 



35 
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10 



30 



Table 1. The Putative Protein Domains in HCV 

a .a. coordinates f approximate) Protein 

1-191 C 

192 - 383 El 

384 - 750 E2/NS1 

751 - 1006 NS2 

1007 - 1488 NS3 

1489 - 1959 NS4 

1960 - 3011 NS5 



As discussed above, a number of HCV isolates 
have been identified. Comparative sequence analysis of 
complete and partial HCV sequences indicates that based 
upon homology at the nucleotide and amino acid levels, 
15 HCV isolates can be broadly sub -divided into at least 
three basic groups (Table 2) . See Houghton et al., 
(1991) Hepatology 14.:381-388. However, only partial 
sequence is available for the isolates in group III. 
Therefore, when the sequences of these isolates are more 
20 defined, one or more of these isolates may deserve 

separation into a different group, including a potential 
fourth group. Table 3 shows the sequence homologies 
between individual viral proteins of different HCV 
isolates as deduced from their nucleotide sequences. It 
25 can be seen that the proteins of the same virus group 
exhibit greater sequence similarity than the same 
proteins encoded by different virus groups (Table 3) . 
One exception to this is the nucleocapsid protein that is 
highly conserved among all group I and II viral isolates 
sequences to date. (In Table 3, the symbol N/A signifies 
that the sequences were not available for comparison.) 
For purposes of the present invention, therefore, group I 
isolates can be defined as those isolates having their 
viral proteins, pax ricul-arly-El-and-g2/NSl-^)roteins,- 



35 about 90% homologous or more at the amino acid level to 
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10 



15 



the isolates classified as group I herein. Group II is 
defined in an analogous manner. Future groups can 
likewise be defined in terms of viral protein homology to 
a prototype isolate. Subgroups can also be defined by 
homology in limited proteins, such as the El, B2/NS1 or 
NS2 proteins, or by simply higher levels of homology. 

Table 2. Classification of hepatitis C viral 

genome RNA sequences into three basic crrouns . 



HCV I 




HCV III 


HCV-1 


HCV-J1.1 


Clones A,C,D&E 


HC-J1 


HC-J4 


HCV-K2 (a&b) 


HCT 18 


HCV- J 




HCT 23 


BK 




Th 


HCV-K1 




HCT 27 






EC1 






Pt-1 







Table 3. Amino Acid Homologies (%) Between Viral 

Proteins Encoded by Different HCV Isolates 

HCV C £1 B2/NS1 NS2 NS3_ Ij[S4_ N^l 
Group 
25 I compared to 

I 98-100 94-100 N/A N/A N/A N/A 99-100 

II 97-98 77-79 78-81 75-77 91-92 90-93 84-88 

III N/A N/A N/A N/A 86 76-80 71-74 



is compared to 

II 98-100 92-100 89-100 
III ' N/A N/A N/A 



93-100 94-100 97-100 95-100 
N/A 84 76 74-75 



Ill-compa red-to ____ ____ = 

35 III N/A N/A N/A N/A N/A 91-100 89-100 
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It is noteworthy that the putative viral 
envelope proteins encoded by the El and E2/NS1 genes show 
substantial amino acid sequence variation between groups 
5 I and II. Only NS2 exhibits a greater degree of 

heterogeneity, while the C, NS3, NS4 and NS5 proteins all 
show greater sequence conservation between groups. The 
sequence variation observed in the putative virion 
envelope proteins between groups I and II reflects a 
10 characteristic segregation of amino acids between the two 
groups. An example of this is shown in Figure 2 where 
the sequence of the El gene product is compared between 
viruses of groups I and II. The El amino acid sequences 
deduced from nucleotide sequences of HCV groups II and II 
15 are shown. In the figure, the horizontal bars indicate 
sequence identity with HCV-1. The asterisks indicate 
group- specific segregation of amino acids; the group- 
specific residues can be clearly identified. Group I 
sequences are HCV-1, HCT18, HCT23, HCT27, and HC-J1. 
20 Group II sequences are HC-J4, HCV- J, HCV Jl.l, and BK. 
Such group-specific segregation of amino acids is also 
present in other gene products including gp72 encoded by 
the E2/NS1 gene. Figure 3 shows the comparative amino 
acid sequence of the putative E2/NS1 region of HCV 
25 isolates which segregate as group I and group II. The 

latter protein also contains an N- terminal hypervaxiable 
region ("HV") of about 30 amino acids that shows large 
variation between nearly all isolates. See Weiner et al. 
(1991) , supra. This region occurs between amino acids 
384 to 414, using the amino acid numbering system of 
HCV-1. . 

The putative HCV envelope glycoprotein E2/NS1 
may 'c^respond to the gp53 (BVDV) /gp55 (Hog Cholera Virus) 
envelope poly^trde-of-the-pestiviruses-and-the-NSl-of_ 



30 
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the flavi viruses, both of which confer protective 
immunity in hosts vaccinated with these polypeptides. 

Striking similarities between the 
hypervariable region ( n HV n ) and HIV-l gpl20 V3 domains 
5 with respect to degree of sequence variation, the 
predictive effect of amino acid changes on putative 
antibody binding in addition to the lack of defined 
secondary structure suggest that the HV domain encodes 
neutralizing antibodies. 

10 The immunogenicity of the domain is shown by 

antibody epitope mapping experiments , described in the 
Examples. The results of these studies suggest that in 
addition to the three major groups of HCV, HV specific 
sub-groups also exist. 

15 Analysis of biological samples from individuals 

with HCV induced NANBH indicate that individuals may be 
carrying two or more HCV variants simultaneously. Two 
co- existing HV variants were found in the plasma of one 
individual, Jl. In addition, partial sequencing of the 

20 gene of an individual with chronic NANBH, who had 

intermittent flares of hepatitis, revealed that the 
individual, Q, was infected with two HCV variants (Qi or 
Q3) . Each variant was associated with only one episode 
of the disease. An BLISA using a Ql or Q3 specific 

25 peptide {amino acids 396-407) showed that Q developed an 
antibody response to the Ql peptide but not the 
corresponding Q3 peptide, suggesting that Q's 
recrudescence of disease was due to the appearance of an 
HV variant. The presence of antibodies to the Ql peptide 

30 but lack of humoral immune response to the Q3 peptide 
during the second episode of disease suggest that 
variation in the HV domain may result from the pressure 
of Immune ' selection. Amino acids 396-407 appear to be 

subject-to-i-the-greatest-selective-pressure-in-the — HV — ■. — 

35 domain. These findings support the thesis that high 
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levels of chronicity associated with .the disease might be 
due to an inadequate immunological host response to HCV 
infection and/or effective viral mechanisms of 
immunological evasion. Moreover, they point to the 
5 B2/NS1 HV region as a genetic region involved in a viral 
escape mechanism and/or an inadequate immunological 
response mechanism (s) . 

As discussed above, there are several variant 
regions within the HCV genome. One or more of these 

10 regions are most likely involved in a viral escape 

mechanism and/or an inadequate immunological response 
mechanism. Therefore, it is desirable to include in 
compositions for treatment of HCV polypeptides which 
would induce an immunogenic response to these variants. 

15 in that the El and E2/NS1 regions of the genome 

encode putative envelope type polypeptides, these regions 
would be of particular interest with respect to 
immunogenicity. Thus, these regions are amongst those to 
which it would be particularly desirable to induce and/or 

20 increase an immune response to protect an individual 

against HCV infection, and to aid in the prevention of 
chronic recurrence of the disease in infected 
individuals. In addition, these regions would be amongst 
those from which it would be desirable to detect HCV 

25 variants which are arising during the course of 

infection, as well as super- or co-infection by two or 

more variants. 

The present invention describes compositions 
and methods for treating individuals to prevent HCV 
30 infections, and particularly chronic HCV infections. In 
addition, it describes compositions and methods for 
detecting the presence of anti-HCV antibodies in 
biological samples. This latter method is particularly 
u^fua-itt-ide^ 

35 response to immunologically distinct HCV epitopes. This 
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method can also be used to study the evolution of 
Multiple variants of HCV within an infected individual. 
In the discussion of the invention , the following 
definitions are applicable. 
5 The term "polypeptide" refers to a polymer of 

amino acids and does not refer to a specific length of 
the product; thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This 
term also does not refer to or exclude post -expression 

10 modifications of the polypeptide, for example, 

glycosylations, acetylations , phosphorylations and the 
like. Included within the definition are, for example, 
polypeptides containing one or more analogues of an amino 
acid (including, for example, unnatural amino acids, 

15 etc.), polypeptides with substituted linkages, as well as 
other modifications known in the art, both naturally 
occurring and non- naturally occurring. 

As used herein, A is "substantially isolated" 
from B when the weight of A is at least about 70%, more 

20 preferably at least about 80%, and most preferably at 

least about 90% of the combined weights of A and B. The 
polypeptide compositions of the present invention are 
preferably substantially free of human or other primate 
tissue (including blood, serum, cell lysate, cell 

25 organelles, cellular proteins, etc.) and cell culture 
medium. 

A "recombinant polynucleotide" intends a 
polynucleotide of genomic, cDNA, semisynthetic, or 
synthetic origin which, by virtue of its origin or 

30 manipulation: (l) is not associated with all or a portion 
of a polynucleotide with which it is associated in 
nature", (2) is linked to a polynucleotide other than that 

to^which_it_is_linked_in_nature,_or_(3.) does_not_occur_in 

nature 

35 
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A "polynucleotide" is a polymeric form of 
nucleotides of any length, either ribonucleotides or 
deoxyribonucleotides . This term refers only to the 
primary structure of the molecule. Thus, this term 
5 includes double- and single -stranded DNA and RNA. It 
also includes known types of modifications, for example, 
labels which are known in the art, methylation, "caps", 
substitution of one or more of the naturally occurring 
nucleotides with an analog, internucleotide modifications 

10 such as, for example, those with uncharged linkages 
(e.g., phosphorothioates , phosphorodithioates , etc . ) , 
those containing pendant moieties, such as, for example 
proteins (including for e.g., nucleases, toxins, 
antibodies, signal peptides, poly- L- lysine, etc.), those 

15 with intercalators (e.g., acridine, psoralen, etc.), 
those containing chelators (e.g. , metals, radioactive 
metals, etc.), those containing alkylators, those with 
modified linkages (e.g., alpha anomeric nucleic acids, 
etc.), as well as unmodified forms of the polynucleotide. 

20 ■Recombinant host cells", "host cells", 

"cells", "cell lines", "cell cultures", and other such 
terms denoting microorganisms or higher eukaryotic cell 
lines cultured as unicellular entities refer to cells 
which can be or have been, used as recipients for a 

25 recombinant vector or other transfer polynucleotide, and 
include the progeny of the original cell which has been 
transfected. It is understood that the progeny of a 
single parental cell may not necessarily be completely 
identical in morphology or in genomic or total DNA 

30 complement as the original parent, due to natural, 
accidental, or deliberate mutation. 

A "replicon" is any genetic element, e.g., a 
plasmid, a chromosome, a virus, a cosmid, etc., that 
behaves as an autonomous-unit-of-polynuGleotide 
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replication within a cell; i.e., capable of replication 
under its own control . 
~ A "vector" is a replicon further comprising 

sequences providing replication and/or expression of the 
5 open reading frame. 

"Control sequence" refers to polynucleotide 
sequences which are necessary to effect the expression of 
coding sequences to which they are ligated. The nature 
of such control sequences differs depending upon the host 

10 organism; in prokaryotes, such control sequences 

generally include promoter, ribosomal binding site, and 
terminators; in eukaryotes, generally, such control 
sequences include promoters, terminators and, in some 
instances, enhancers. The term "control sequences" is 

15 intended to include, at a minimum, all components whose 
presence is necessary for expression, and may also 
include additional components whose presence is 
advantageous, for example, leader sequences which govern 
secretion. 

20 A "promoter" is a nucleotide sequence which is 

comprised of consensus sequences which allow the binding 
of RNA polymerase to the DNA template in a manner such 
that mRNA production initiates at the normal 
transcription initiation site for the adjacent structural 

25 gene. 

"Operably linked" refers to a juxtaposition 
wherein the components so described are in a relationship 
permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence 
30 is ligated in such a way that expression of the coding 

sequence is achieved under conditions compatible with the 
control sequences. 

— An "open reading frame" (ORF) is a region of a 

polynucleoti de sequence which encodes a polype ptide; this 
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region may represent a portion of a coding sequence or a 
total coding sequence. 

A "coding sequence" is a polynucleotide 
sequence which is transcribed into mRNA and/or translated 
into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the 
coding sequence are determined by a translation start 
codon at the 5' -terminus and a translation stop codon at 
the 3' -terminus. A coding sequence can include but is 
not limited to mRNA, DNA (including cDNA) , and 
recombinant polynucleotide sequences. 

As used herein, "epitope" or "antigenic 

determinant", means an amino acid sequence that is 

immunoreactive. Generally an epitope consists of at 

least 3 to 5 amino acids, and more usually, consists of 

at least about 8, or even about 10 amino acids. As used 

herein, an epitope of a designated polypeptide denotes 

epitopes with the same amino acid sequence as the epitope 

in the designated polypeptide, and immunologic 

equivalents thereof. 

An "antigen" is a polypeptide containing one or 

more epitopes. 

"Immunogenic" means the ability to elicit a 
cellular and/or humoral immune response. An immunogenic 
response may be elicited by immunoreactive polypeptides 
alone, or may require the presence of a carrier in the 
presence or absence of an adjuvant. 

"Immunoreactive" refers to (1) the ability to 
bind immunologically to an antibody and/or to a 
lymphocyte antigen receptor or (2) the ability to be 

immunogenic. 

An "antibody" is any immunoglobulin, including 

antibodies and fragments thereof, that binds a specific 

"epitope: — THB^amr-wieoiipuM^ 



■i,^.* vwp-- • — 

monoclonal-, and chimeric antibodies. Examples of 
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chimeric antibodies are discussed in U.S. Patent Nos. 
4,816,397 and 4,816,567. 

An "antigen set" is defined as a composition 
consisting of a plurality of substantially identical 
5 polypeptides, wherein the polypeptides are comprised of 
an amino acid sequence of one defined epitope. 

"Substantially identical polypeptides" means 
polypeptides that are identical with the exception of 
variation limited to the typical range of sequence or 
10 size variation attributable to the polypeptide's method 
of production; e.g., recombinant expression, chemical 
synthesis, tissue culture, etc. This variation does not 
alter the desired functional property of a composition of 
substantially identical polypeptides; e.g., the 
15 composition behaves immunologically as a composition of 
identical polypeptides. The variations may be due to, 
for example, alterations resulting from the secretory 
process during transport of the polypeptide, less than 
100% efficiency in chemical synthesis, etc. 
20 As used herein, a "variable domain" or "VD" of 

a viral protein is a domain that demonstrates a 
consistent pattern of amino acid variation between at . 
least two HCV isolates or subpopulations . Preferably, 
the domain contains at least one epitope. Variable 
25 domains can vary from isolate to isolate by as little as 
l amino acid change. These isolates can be from the same 
or different HCV group (s) or subgroup (s). Variable 
domains can be readily identified through sequence 
composition among isolates, and examples of these 
30 techniques are described below. For the purposes of 

describing the present invention, variable domains will 
be defined with respect to the amino acid number of the 
poiyprotetn encoded by the genome of HCV-1 as shown in 
P-igure-9 T -wi-th— the— initiator— methionine-being_designated_ 
35 position 1. The corresponding variable domain in another 
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15 



HCV isolate is determined by aligning the two isolates 
sequences in a manner the brings the conserved domains 
outside any variable domain into maximum alignment. This 
can be performed with any of a number of computer 
5 software packages, such as ALIGN 1.0, available from the 
University of Virginia, Department of Biochemistry (Attn: 
Dr. William R. Pearson) . See Pearson et al., (1988) 
Proc. Natl. Acad. Sci. USA fi£:2444-2448. It is to be 
understood that the amino acid numbers given for a. 
10 particular variable domain are somewhat subjective and a 
matter of choice. Thus, the beginning and end of 
variable domains should be understood to be approximate 
and to include overlapping domains or subdomains, unless 

otherwise indicated. 

An epitope is the "immunologic equivalent" of 
another epitope in a designated polypeptide when it 
cross-reacts with antibodies which bind immunologically 
to the epitope in the designated polypeptide. 

Epitopes typically are mapped to comprise at 
least about five amino acids, sometimes at least about 8 
amino acids, and even about 10 or more amino acids. 

The amino acid sequence comprising the HCV 
epitope may be linked to another polypeptide (e.g., a 
carrier protein) , either by covalent attachment or by 
25 expressing a fused polynucleotide to form a fusion 

protein. If desired, one may insert or attach multiple 
repeats of the epitope, and/or incorporate a variety of 
epitopes. The carrier protein may be derived from any 
source, but will generally be a relatively large, 
30 immunogenic protein such as BSA, KLH, or the like. If 
desired, one may employ a substantially full-length HCV 
protein as the carrier, multiplying the number of 
immunogenic epitopes. Alternatively, the amino acid 

sequence from the HCV^rt^-may-be-lii^ed-at^fehe-ainino- 

35 terminus and/or carboxy terminus to a non-HCV amino acid 



20 
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sequence, thus the polypeptide would be a "fusion 
polypeptide". Analogous types of polypeptides may be 
constructed using epitopes from other designated viral 
proteins . 

5 A "variant" of a designated polypeptide refers 

to a polypeptide in which the amino acid sequence of the 
designated polypeptide has been altered by the deletion, 
substitution, addition or rearrangement of one or more 
amino acids in the sequence. Methods by which variants 
10 occur (for example, by recombination) or are made (for 
example, by site directed mutagenesis) are known in the 
art. 

"Transformation " refers to the insertion of an 
exogenous polynucleotide into a host cell, irrespective 
15 of the method used for the insertion, for example, direct 
uptake, transduction (Including viral infection) , f - 
mating or electroporation. The exogenous polynucleotide 
nay be maintained as a non- integrated vector, for 
example, a plasmid or viral genome, or alternatively, may 
20 be integrated into the host genome. 

An "individual" refers to a vertebrate, 
particularly a member of a mammalian species, and 
includes but is not limited to rodents (e.g., mice, rats, 
hamsters, guinea pigs) , rabbits, goats, pigs, cattle, 
25 sheep, and primates (e.g., chimpanzees, African Green 
Monkeys, baboons, orangutans, and humans). 

As used herein, "treatment" refers to any of 
(i) the prevention of infection or reinfection, as in a 
traditional vaccine, (ii) the reduction or elimination of 
30 symptoms, and (iii) the substantial or complete 

elimination of the virus. Treatment may be effected 
prophylactically (prior to infection) or therapeutically 
( following infection) . 
T^e-tem-^e^fee^ive-amount^refers— to-an-amount- 

35 of epitope -bearing polypeptide sufficient to induce an 
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immunogenic response in the individual Co which it is 
administered, or to otherwise detectably immunoreact in 
its intended system (e.g., immunoassay). Preferably, the 
effective amount is sufficient to effect treatment, as 
5 defined above. The exact amount necessary will vary from 
application. For vaccine applications or in the 
generation of polyclonal antiserum/antibodies, for 
example, the effective amount may vary depending on the 
species, age, and general condition of the individual, 
10 the severity of the condition being treated, the 
particular polypeptide selected and its mode of 
administration, etc. It is also believed that effective 
amounts will be found within a relatively large, non- 
critical range. An appropriate effective amount can be 
15 readily determined using only routine experimentation. 

As used herein, a "biological sample" refers to 
a sample of tissue or fluid isolated from an individual, 
including but not limited to, for example, plasma, serum, 
spinal fluid, lymph fluid, the external sections of the 
20 skin, respiratory, intestinal, and genitourinary tracts, 
tears, saliva, milk, blood cells, tumors, organs, 
biopsies and also samples of in vjtrp cell culture 
constituents (including but not limited to conditioned 
medium resulting from the growth of cells in cell culture 
25 medium, e.g., Mab producing myeloma cells, recombinant 
cells, and cell components). 

The immunoreactive polypeptide compositions of 
the present invention comprise a mixture of isolate- or 
group-specific epitopes from at least one HCV VD. Thus, 
30 there will be present at least two heterogeneous amino 

acid sequences each defining an epitope found in distinct 
HCV_isolates located in the same or substantially same 
phy sical location in an HCV protein; i.e. each sequence 
maps to the same location within the HCV 
35 genome/polypeptide. Since the sequences are 
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heterogeneous, the location is referred to as a variable 
domain {VD) . 

To better understand the invention, first the 
individual amino acid sequences that make up the 
5 compositions of the invention will be explained. Then 
the plurality of such sequences which are found in the 
compositions of the present invention will be discussed. 

The amino acid sequence that characterizes the 
polypeptides of the present invention have a basic 
10 structure as follows: 

VZ-LV {I) 
Z represents the amino acid sequence from a region of a 

protein from. a selected HCV isolate, where the region 

comprises at least one variable domain and the variable 

15 domain comprises at least one epitope. L and L' are 

non-HCV amino acid sequences or HCV amino acid sequences 
that do not contain a variable domain, and which can be 
the same or different, y and y' are 0 or 1 and can be 
the same or different. Thus, formula I represents an 

20 amino acid sequence comprising the sequence of an HCV VD, 
wherein the VD comprises an epitope. 

As discussed above, the epitope (s) in Z will 
usually comprise a minimum of about 5 amino acids, more 
typically a minimum of about 8 amino acids, and even more 

25 typically a minimum of about 10 amino acids. 

The variable domain of Z can comprise more than 
one epitope. The variable domain of Z is at least as big 
as the combined sequences of the epitopes present, thus 
making it typically a minimum of about 5 amino acids when 

30 a single epitope is present. Since epitopes can overlap, 
the minimum amino acid sequence for combined epitopes in 
the variable domain may be less than the sum of the 
individual epitopes' sequences. 

: Z~ is - the~amino-acid-sequence-of— an-HGV— ^ isolate— 

35 comprising the above -described VD. Thus, the minimum 
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size of Z is the minimum size of the VD. z can comprise 
more HCV amino acid sequence than just the VD, and can 
further comprise more than one VD. The maximum size of Z 
is not critical, but obviously cannot exceed the length 
5 of the entire HCV polyprotein. Typically, however, Z 
will be the sequence of an entire HCV protein 
(particularly El, E2/NS1, NS2, NS3, NS4 and NS5) or, even 
more typically, a fragment of such an HCV protein. Thus, 
Z will preferably range from a minimum of about 5 amino 

10 acids (more preferably about 8 or about 10 amino acids 
minimum) to a maximum of about 1100 amino acids (more 
preferably a maximum of about 500, more preferably a 
maximum of about 400 or even more preferably a maximum of 
about 200 amino acids maximum) . More usually, the 

15 polypeptide of formula I and/or Z, when prepared by, 

e.g., chemical synthesis, is a maximum of about 50 amino 
acids, more typically a maximum of about 40 amino acids, 
and even more typically a maximum of about 30 amino 
acids. 

20 The non-HCV amino acid sequences, L and L' , if 

present, can constitute any of a number types of such 
sequences. For example, L and L' can represent non-HCV 
sequences to which Z is fused to facilitate recombinant 
expression (e.g., beta-galactosidase, superoxide 
25 dismutase, invertase, alpha-factor, TPA leader, etc.), as 
discussed below. Alternatively, L and L' can represent 
epitopes of other pathogens, such as hepatitis B virus, 
jBordeteUa pertussis, tetanus toxoid, diphtheria, etc., 
to provide compositions that are immunoreactive relative 
30 to a number these other pathogens. L and L' can be amino 
acid sequences that facilitate attachment to solid 
supports during peptide synthesis, immunoassay supports, 
vaccin e carrier proteins, etc. In fact, L and L' can 



even comprise one or more superfluous amino acids with no 
35 functional advantage. There is no critical maximum size 
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for L or L' , the length being generally governed by the 
_ desired function. Typically, L and L' will each be a 
maximum of about 2000 amino acids, more typically a 
maximum of about 1000 amino acids. The majority of L and 
5 L' sequences with useful properties will be a maximum of 
about 500 amino acids. It is desirable, of course, to 
select L and L' so as to not block the immunoreactivity 
of Z. 

The composition of polypeptides provided 
10 according to the present invention are characterized by 

the presence (in an effective amount for 

immunoreactivity) within the composition of at least two 

amino acid sequences defined as follows by formulas II 

and III, respectively: 
15 Ly-Zi-L'y. (II) 

L, L' , y and y' are defined as above, as well as 
independently defined for each of formulas II and III. 
z, and Z 2 are each HCV amino acid sequences as defined for 

20 z above encompassing the same variable domain (i.e., 
physical location) , but derived from different HCV 
isolates having between them at least one heterogeneous 
epitope in the common variable domain of Z t and Zj. As an 
illustrative example, an amino acid sequence according to 

25 formula II could have as Z t a fragment the hypervariable 
domain spanning amino acidB 384-414 of isolate HCV-l (or 
more particularly 396-407 or 396-408) , while Zj is the 
analogous fragment from isolate HCV-J1.1. These two 
isolates are heterogeneous in this domain, the amino acid 

30 sequences of the epitopes varying significantly. 

It is to be understood that the compositions of 
th e pre sent invention may comprise more than just two 
discrete amino acid sequences according to formula I, and 
that the Z sequences may be divided into groups 

35 encompassing different variable domains. For example, a 
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composition according to the present invention could 
comprise a group of HCV sequences (with amino acid 
sequences according to formula I) encompassing the 
hypervariable domain at amino acids 384-411 from isolates 
5 HCV-1, HCV-Jl.l, HC-J1. HC-J4, etc. The composition 

could also comprise an additional group of HCV sequences 
(within amino acid sequences according to formula I) 
encompassing the variable domain at amino acids 215-255 
also from isolates HCV-1, HCV-J1.1, HC-J1, HC-J4, etc. 
10 Within the context of the compositions of the present 
invention, therefore, the sequence of formula I can be 
further defined as follows: 

SV n (iv) 
V represents an amino acid sequence comprising the 

15 sequence of an HCV variable domain, wherein the variable 
domain comprises at least one epitope; i.e., formula I. 
S and n are integers of 1 or greater. S represents a 
particular variable domain, and n represents a particular 
isolate. For example, S-l could represent the variable 

20 domain at amino acids 384-411; S»2 could represent the 

variable domain at amino acids 215-255; and n-1, 2, 3 and 
4 could represent isolates HCV-1, HCV-J1.1, HC-J1 and HC- 
J4, respectively. Thus, the two groups of sequences 
discussed above could be represented by: 

25 Group 1: 1V„ 1V„ 1V 3 & 1V 4 

Group 2: 2V,, 2V 2 , 2V 3 & 2V 4 
There are at least two distinct sequences of 
formula IV in the compositions according to the present 
invention; i.e., the composition contains two different 

30 sequences according to formula IV where the values for S 
and or n are different. For example, at least IV, and 1V 2 
are jiresent, or at least IV, and 2V 2 are present, or at 
least IV, and 2V, are present. 

The distinct sequences faIlxW^t nin_forinula 

35 iv are present in the composition either on the same or 
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different polypeptide molecules. Using the minimum 
combination of IV, and 1V 2 to illustrate, these two 
sequences could be present in the same polypeptide 
molecule (e.g., 1V,-1V 2 ) or in separate molecules. This 
5 feature of the compositions of the present invention can 
be described as compositions of polypeptides as follows: 

wherein S, V and n are as defined above; R and R' are 
amino acid sequences of about 1-2000 amino acids, and are 

10 the same or different; r and r' are 0 or 1, and are the 
same or different; x is an integer > 1; n is 
independently selected for each x; and with the proviso 
that amino acid sequences are present in the composition 
representing a combination selected from the group 

15 consisting of (i) IV, and 1V 3 , (ii) IV, and 2V 2 , and (iii) 
IV, and 2V,. In embodiments where the distinct sequences 
of formula IV are in different polypeptides, x can be 1, 
although it can still be >1 if desired; e.g., a mixture 
of polypeptides 1V,-1V 2 and lV|-2V a . When x is 1, r and 

20 r' are preferably both 0 to avoid redundancy with Ly and 
L' y ., since v can be described by in a preferred 
embodiment by formula I. When x is >l, the combined 
lengths of R and the adjacent L, and of R' and the 
adjacent L' , are preferably no more than the typical 

25 maximum lengths described above for L and L' . 

The selection of the HCV amino acid sequences 
included within the distinct V sequences of the 
compositions will depend upon the intended application of 
the sequences and is within the skill of the art in view 

30 of the present disclosure. First, it should be 

appreciated that the HCV epitopes of concern to the 
present invention can be broken down into two types. The 
first type of epitopes are those that are "group- 

specif i"c"i — i~eT7~the — correspon6^ng _ epitopes-in-ai-l— or 

35 
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substantially all isolates within an HCV isolate group 
are immunologically cross -reactive with each other, but 
not with the corresponding epitopes of substantially all 
the isolates of another group. Preferably, the epitopes 
5 in a group- specific class are substantially conserved 
within the group, but not between or among the groups. 
The second type of epitopes are those that are n isolate - 
specific"; i.e., the epitope is immunologically cross - 
reactive with substantially identical isolates, and is 
10 not cross -reactive with all or substantially all distinct 
isolates . 

These group- and isolate -specific epitopes can 
be readily identified in view of the present disclosure. 
First, the sequences of several HCV isolates is compared, 

15 as described herein, and areas of sequence heterogeneity 
identified. The pattern of heterogeneity usually 
indicates group or isolate specificity. If an identified 
area is known to comprise one or more epitopes, then a 
sequence of sufficient size to include the desired 

20 epitope (s) is selected to as an variable domain that may 
be included in the compositions of the present invention. 
If the immunoreactivity of a given heterogeneous area is 
not known, peptides representing the sequences found in 
that area of the various HCV isolates can be prepared and 

25 screened. Screening can include, but is not limited too, 
immunoassays with various sources of anti-HCV antibody 
(e.g., patient serum, neutralizing Mabs, etc.) or 
generation of antibody and testing the ability of such 
antibody to neutralize virus in vitro. Alternatively, 

30 the loci of epitopes identified in a screening protocol, 
such as. that described below, can be examined for 
heterogeneity among various isolates and the 
immunological properties of corresponding heterogeneous 



sequences screened. 
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For vaccine applications, it is believed that 
-Variable domains from the El and/or E2/NS1 domains will 
be of particular interest. In particular, an El variable 
domain within amino acids 215-255 (see Figure 2) , and an 
5 E2/NS1 variable domain within amino acids 384-414 (see 
Figure 3), have been identified as being important 
dmmunoreactive. domains. The preliminary evidence 
suggests that one or both of these domains may be loci of 
heterogeneity responsible for escape mutants, leading to 

10 chronic HCV infections. Thus, polypeptide compositions 
as described above where the variable domain (s) in V are 
one or both of these variable domains are particularly 
preferred. Furthermore, the polypeptide compositions of 
the present invention, while particularly concerned with 

15 the generally linear epitopes in the variable domains, 
may also include conformational epitopes. For example, 
the composition can be comprised of a mixture of 
recombinant Bl and/ or B2/NS1 proteins (exhibiting the 
variable domains of different isolates) expressed in a 

20 recombinant system (e.g., insect or mammalian cells) that 
maintains conformational epitopes either inside or 
outside the variable domain. Alternatively, an El and/or 
E2/NS1 subunit antigen from a single isolate that 
maintains conformational epitopes can be combined with a * 

25 polypeptide composition according to the present 

invention (e.g., a mixture of synthetic polypeptides or 
denatured recombinant polypeptides) . In another 
preferred application for vaccines, the polypeptide 
compositions described herein are combined with other HCV 

30 subunit antigens, such as those described in commonly 

owned U.S. S.N. , entitled "Hepatitis C Virus 

Asialoglycoproteins" (Attorney Docket No. 0154.002) by 

Rober t O. Ralston, Frank MarcuB, Kent B. Thudium, Barbara 

Gervase, and John Hall, filed on even date herewith, and 

35 incorporated herein by reference. 
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For diagnostic application, it may be useful to 
employ the compositions of the present invention as 
antigens, thereby improving the ability to detect 
antibody to distinct HCV isolates. Typically the 
5 polypeptide mixtures can used directly in a homogeneous 
or heterogeneous immunoassay format, the latter 
preferably comprising immobilizing the polypeptide on a 
solid substrate (e.g., microtiter plate wells, plastic 
beads, nitrocellulose, etc.). See, a.g„, PCT Pub. No. 

10 WO90/11089; EPO Pub. No. 360,088; IMMUNOASSAY: A 
PRACTICAL GUIDE, supra. Alternatively, each 
substantially identical polypeptide that makes up the 
polypeptide composition of the present invention could be 
immobilized on the same support at discrete loci, thereby 

15 providing information as to which isolate or group the 
antibody has been generated. This may be particularly 
important in diagnostics if various isolates cause 
hepatitis, cancer or other diseases with different 
clinical prognoses. A preferred format is the Chiron 

20 RIBA™ strip immunoassay format, described in commonly 
owned U.S. S.N. 07/138,894 and U.S. S.N. 07/456,637, the 
disclosures of which are incorporated herein by 
reference. 

Polypeptides useful in the manufacture of the 
25 compositions of the present invention can be made 
recombinantly, synthetically or in tissue culture. 
Recombinant polypeptides comprised of the truncated HCV 
sequences or full-length HCV proteins can be made up 
entirely of HCV sequences (one or more epitopes, either 
30 contiguous or noncontiguous) , or sequences in a fusion 
protein. In fusion proteins, useful heterologous 
sequences include sequences that provide for secretion 
from a recombinant host, enhance the immunological 
react ivity of the HCV epitope (s), or facilitate the 
35 coupling of the polypeptide to a support or a vaccine 
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carrier. See, e.g., EPO Pub. No. 116,201; U.S: Pat. No. 
4,722840; EPO Pub. No. 259,149; U.S. Pat. No. 4,629,783, 
'the disclosures of which are incorporated herein by- 
reference . 

Full length as well as polypeptides comprised 
of truncated HCV sequences, and mutants thereof, may be 
prepared by chemical synthesis. Methods of preparing 
polypeptides by chemical synthesis are known in the art. 
They may also be prepared by recombinant technology. A 
DNA sequence encoding HCV- i , as well as DNA sequences of 
variable regions from other HCV isolates have been 
described and/ or referenced herein. The availability of 
these sequences permits the construction of 
polynucleotides encoding immunoreactive regions of HCV 

polypeptides . 

Polynucleotides encoding the desired 
polypeptide comprised of one or more of the 
immunoreactive HCV epitope from a variable domain of HCV 
may be chemically synthesized or isolated, and inserted 
into an expression vector. The vectors may or may not 
contain portions of fusion sequences such as beta- . 
Galactosidase or superoxide dismutase (SOD) . Methods and 
vectors which are useful for the production of 
polypeptides which contain fusion sequences of SOD are 
described in European Patent Office Publication number 
0196056, published October 1, 1986. 

The DNA encoding the desired polypeptide, 
whether in fused or mature form and whether or not 
containing a signal sequence to permit secretion, may be 
ligated into expression vectors suitable for any 
convenient host. The hosts are then transformed with the 
expression vector. Both eukaryotic and prokaryotic host 
systems are presently used in forming recombinant 
polypeptides, and a summary of some of the more common 
control systems and host cell lines is presented infra. 
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The host cells axe incubated under conditions which allow 
expression of the desired polypeptide. The polypeptide 
is then isolated from lysed cells or from the culture 
medium and purified to the extent needed for its intended 
5 use. 

The general techniques used in extracting the 
HCV genome from a virus, preparing and probing DMA 
libraries, sequencing clones, constructing expression 
vectors, transforming cells, performing immunological 
10 assays such as radioimmunoassays and ELISA assays, for 
growing cells in culture, and the like, are known in the 
art. (See, e.g., the references cited in the "Background" 
section, above, as well as the references cited at the 
beginning of this ("Modes of Practicing the Invention 

15 section above. 

Transformation of the vector containing the 
desired sequence into the appropriate host may be by any 
known method for introducing polynucleotides into a host 
cell, including, for example, packaging the 

20 polynucleotide in a virus and transducing the host cell 
with the virus, or by direct uptake of the 
polynucleotide. The transformation procedure used 
depends upon the host to be transformed. Bacterial 
transformation by direct uptake generally employs 

25 treatment with calcium or rubidium chloride (Cohen 
(1972), Proc. Natl. Acad. Sci. USA £2.: 2110. Yeast 
transformation by direct uptake may be carried out using 
the method of Hinnen et al. (1978), J. Adv. Enzyme 
Reg .2:1929. Mammalian transformations by direct uptake 

30 may be conducted using the calcium phosphate 

precipitation method of Graham and Van der Eb (1978) , 
Virology 52.: 546, or the various known modifications 
thereof. Other methods for the introduction of 

reccnfoinattt^polynuc-l-eo^ 

35 mammalian cells, which are known in the art include 
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dextran mediated transf ection, calcium phosphate mediated 
transf ection, polybrene mediated transfection, protoplast 
^fusion, electroporation, encapsulation of the 
polynucleotide (s) in liposomes, and direct microinjection 
5 of the polynucleotides into nuclei. 

In order to obtain expression of desired coding 
sequences, host cells are transformed with 
polynucleotides (which may be expression vectors) , which 
are comprised of control sequences operably linked to the 

10 desired coding sequences. The control sequences are 

compatible with the designated host. Among prokaryotic 
hosts, E- coli is most frequently used. Expression 
control sequences for prokaryotes include promoters , 
optionally containing operator portions, and ribosome 

15 binding sites. Transfer vectors compatible with 

prokaryotic hosts are commonly derived from, for example, 
pBR322, a plasmid containing operons conferring 
ampicillin and tetracycline resistance, and the various 
pUC vectors, which also contain sequences conferring 

20 antibiotic resistance markers. Promoter sequences may be 
naturally occurring, for example, the S- lactamase 
(penicillinase) (Weissman (1981), "The cloning. of 
interferon and other mistakes" in interferon 3 (ed. I. 
Gresser) , lactose (lac) (Chang et al. (1977), Nature 

25 1£8:1056) and tryptophan (trp) (Goeddel et al. (1980), 

Nucl. Acids Res. fi:4057) , and lambda -derived P L promoter 
system and N gene ribosome binding site (Shimatake et al. 
(1981), Nature 29^:128) . In addition, synthetic 
promoters which do not occur in nature also function as 

30 bacterial promoters. For example, transcription 

activation sequences of one promoter may be joined with 
the ope r on sequences of another promoter, creating a 
syoefeetic hybrid promoter (e.g., the tac promoter, which 
is derived from s eq uences of the tm and lag promoters 

35 (De Boer et al. (1983), Proc. Natl. Acad. Sci. USA 
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80 : 21) . The foregoing systems are particularly 
compatible with E. coli : if desired, other prokaryotic 
hosts such as strains of Bacillus or Pseudomonas may be 
used, with corresponding control sequences. 
5 Bukaryotic hosts include yeast and mammalian 

cells in culture systems. Saccharomvces cerevisiae and 
.qar.r-haromvces carlsbergensis are the most commonly used 
yeast hosts, and are convenient fungal hosts. Yeast 
compatible vectors generally carry markers which permit 

10 selection of successful transformants by conferring 

prototropy to auxotrophic mutants or resistance to heavy 
metals on wild- type strains. Yeast compatible vectors 
may employ the 2 micron origin of replication (Broach et 
al. (1983), Meth. Enz. l£i:307), the combination of CEN3 

15 and ARS1 or other means for assuring replication, such as 
sequences which will result in incorporation of an 
appropriate fragment into the host cell genome. Control 
sequences for yeast vectors are known in the art and 
include promoters for the synthesis of glycolytic enzymes 

20 (Hess et al. (1958), J. Adv. Enzyme Reg. 2:149); for 

example, alcohol dehydrogenase (ADH) (E.P.O. Publication 
No. 284044), enolase, . glucokinase, glucose- 6 -phosphate 
isomerase, glyceraldehyde- 3 -phosphate dehydrogenase (GAP 
or GAPDH) , hexokinase, phosphofructokinase, 3- 

25 glycerophosphate mutase, and pyruvate kinase (PyK) (E.P.O. 
Publication No. 329203). The yeast PH05 gene, encoding 
acid phosphatase, also provides useful promoter 
sequences. In addition, synthetic promoters which do not 
occur in nature also function as yeast promoters. For 

30 example, upstream activating sequences (UAS) of one yeast 
promoter may be joined with the transcription activation 
region of another yeast promoter, creating a synthetic 
hybrid promoter. Examples of such hybrid promoters 
include - " the - ADH _ regulatory-sequence-l-inked-to-the-GAP 

35 transcription activation region (U.S. Patent Nos. 
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4,876,197 and 4,880,734). Other examples of hybrid 
promoters include promoters which consist of the 
regulatory sequences of either the ADH2, GAL4, GAL10, or 
PH05 genes, combined with the transcriptional activation 
5 region of a glycolytic enzyme gene such as GAP or PyK 
(E.P.O. Publication No. 164556) . Furthermore, a yeast 
promoter can include naturally occurring promoters of 
non- yeast origin that have the ability to bind yeast RNA 
'polymerase for the appropriate initiation of 

10 transcription. 

Other control elements which may be included in 
the yeast expression vector are terminators (e.g., from 
GAPDH, and from the enolase gene (Holland (1981)., J. 
Biol. Chem. 256 :1385) , and leader sequences. The leader 

15 sequence fragment typically encodes a signal peptide 
comprised of hydrophobic amino acids which direct the 
secretion of the protein from the cell. DNA encoding 
suitable signal sequences can be derived from genes for 
secreted yeast proteins, such as the yeast invertase gene 

20 (E.P.O. Publication No. 12,873) and the a- factor gene 

(U.S. Patent No. 4,588,684). Alternatively, leaders of 
non-yeast origin, such as an interferon leader, also 
provide for secretion in yeast (E.P.O. Publication No. 
60057) . A preferred class of secretion leaders are those 

25 that employ a fragment of the yeast a- factor gene, which 
contains both a "pre" signal sequence, and a "pro" 
region. The types of a- factor fragments that can be 
employed include the full-length pre -pro at- factor leader, 
as well as truncated a- factor leaders (U.S. Patent Nos. 

30 4,546,083 and 4,870,008; E.P.O. Publication No. 324274. 
Additional leaders employing an a- factor leader fragment 
that provides for secretion include hybrid a- factor 
leaders made with a pre-sequence of a first yeast, but a 

pro^_region_from_a_second_yeast_of^f actor. (See,_e.._g.._, 

35 P.C.T. WO 89/02463). 
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Expression vectors, either extrachromosomal 
seplicons or integrating vectors, have been developed for 
transformation into many yeasts. For example, expression 
vectors have been developed for Canfllfla albicans (Kurtz 
5 et al. (1986), Mol. Cell Biol. 6:142) , CanAkfla maltosa 
(Kunze et al. (1985) J. Basic Microbiol. 31:141), 
Hanzenula pnl ymorpha (Gleeson et al. (1986) , J. Gen. 
Microbiol. 122:3459), Kl qyverpCTYQeB. ^ragiJ-is (Das et al. 
(1984), J. Bacteriol. 1S£:1165), Kluyveromyces lactig. (De 

10 Louvencourt et al. (1983), J. Bacteriol. 151:737), PieMa 
qiHllerimondii , (Kunze et al. (1985), supra), Plchia 
pastoris (Cregg et al. (1985), Mol. Cell. Biol. 5:3376; 
U.S. Patent Nos. 4,837,148 and 4,929,555)), 
firh^.oBaccharomvces pombe (Beach and Nurse (1981) , Nature 

15 3M:706), and Yarrowia 1 ipolvtlca (Davidow et al. (1985), 

Curr. Genet. 10_:39) . 

Mammalian cell lines available as hosts for 
expression are known in the art and include many 
immortalized cell lines available from the American Type 

20 Culture Collection (ATCC) , including, for example, HeLa 
cells, Chinese hamster ovary (CHO) cells, baby hamster 
kidney (BHK) cells, COS monkey cells, and a number of 
other cell lines . Suitable promoters for mammalian cells 
are also known in the art and include viral promoters 

25 such as that from Simian Virus 40 (SV40) , Rous sarcoma 
virus (RSV) , adenovirus (ADV) and bovine papilloma virus 
(BPV) (See, Sambrook (1989) for examples of suitable 
promoters) . Mammalian cells may also require terminator 
sequences and poly A addition sequences; enhancer 

30 sequences which increase expression may also be included, 
and sequences which cause amplification of the gene may 
alsg.be desirable. These sequences are known in the art. 

Vectors suitable for replication in mammalian 

~ cells are known in the art, and may include- viral 

35 replicons, or sequences which ensure integration of the 
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appropriate sequences encoding the desired polypeptides 
into the host genome. 

A vector which is used to express foreign DNA 
and which may be used in vaccine preparation is Vaccinia 
5 virus. In this case, the heterologous DNA is inserted 
into the Vaccinia genome. Techniques for the insertion 
of foreign DNA into the vaccinia virus genome are known 
in the art, and utilize, for example, homologous 
recombination. The insertion of the heterologous DNA is 

10 generally into a gene which is non-essential in nature, 
for example, the thymidine kinase gene , which also 
provides a selectable marker. Plasmid vectors that 
greatly facilitate the construction of recombinant 
viruses have been described (see, for example, Mackett et 

15 al. (1984) in "DNA Cloning", Vol. II. IRL Press, p. 191, 
Chakrabarti et al. (1985), Mol. Cell Biol. 5:3403; Moss 
(1987) in "Gene Transfer Vectors for Mammalian Cells ■ 
(Miller and Calos, eds. , p. 10). Expression of the 
desired polypeptides comprised of immunoreactive regions 

20 then occurs in cells or individuals which are infected 
and/or immunized with the live recombinant vaccinia 
virus. 

Other systems for expression of polypeptides 
include insect cells and vectors suitable for use in 
25 these cells. These systems are known in the art, and 

include, for example, insect expression transfer vectors 
derived from the baculovirus Autpgrapfia c alifornica . 
nuclear polyhedrosis virus (AcNPV) , which is a helper- 
independent, viral expression vector. Expression vectors 
30 derived from this system usually use the strong viral 
polyhedron gene promoter to drive expression of 
heterologous genes. Currently the most commonly used 
transfer vector for introducing foreign genes into AcNPV 

i-s-pAc3-73-.— Many— other— vectors,— knowh_to_thase_of_skill 

35 in the art, have also been designed for improved 
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expression. These include, for example, pVL985 {which 
alters the polyhedron start codon from ATG to ATT, and 
which introduces a BamHl cloning site 32 basepairs 
downstream from the ATT; See Luckow and Summers (1989), 
5 Virology 12:31. Good expression of nonfused foreign 

proteins usually requires foreign genes that ideally have 
a short leader sequence containing suitable translation 
initiation signals preceding an ATG start signal. The 
plasmid also contains the polyhedron polyadenylation 
10 signal and the ampicill in- resistance (amp) gene and 

origin of replication for selection and propagation in E. 
coli . 

Methods for the introduction of heterologous 
dna into the desired site in the baculovirus are known in 

15 the art. (See Summers and Smith, Texas Agricultural 

Experiment Station Bulletin No. 1555; Ju et al. (1987), 
in "Gene Transfer Vectors for Mammalian Cells (Miller and 
Calos, eds.); Smith et al. (1983), Mol. & Cell. Biol. 
2:2156; and Luckow and Summers (1989), supra). For 

20 example, the insertion can be into a gene such as the 
polyhedron gene, by homologous recombination; insertion 
can also be into a restriction enzyme site engineered 
into the desired baculovirus gene. The inserted 
sequences may be those which encode all or varying 

25 segments of the desired HCV polypeptides including at 
least one epitope from a variable domain. 

The signals for posttranslational 
modifications, such as signal peptide cleavage, 
proteolytic cleavage, and phosphorylation, appear to be 

30 recognized by insect cells. The signals required for 
secretion and nuclear accumulation also appear to be 
conserved between the invertebrate and vertebrate cells. 
Examples of the signal sequences from vertebrate cells 
which are effective in - invertebrate ceirs _ are-known-in 

35 the art, for example, the human interleukin 2 signal 



WO 93/06126 



-37- 



PCT/US92/07683 



(IL2 S ) which is a signal for transport out if the cell, 
is. recognized and properly removed in insect cells. 

It is often desirable that the polypeptides 
prepared using the above host cells and vectors be fusion 
5 polypeptides. As with non- fusion polypeptides, fusion 
polypeptides may remain intracellular after expression. 
Alternatively, fusion proteins can also be secreted from 
the cell into the growth medium if they are comprised of 
a leader sequence fragment. Preferably, there are 

10 processing sites between the leader fragment and the 

remainder of the foreign gene that can be cleaved either 
in vivo or in vitro . 

In cases where the composition is to be used 
for treatment of HCV, it is desirable that the 

15 composition be immunogenic. In instances wherein the 

synthesized polypeptide is correctly configured so as to 
provide the correct epitope, but is too small to be 
immunogenic, the polypeptide may be linked to a suitable 
carrier. A number of techniques for obtaining such 

20 linkage are known in the art, including the formation of 
disulfide linkages using N-succinimidyl-3- (2-pyridyl- 
thio) propionate (SPDP) and succinimidyl 4-(N- 
maleimidomethyl) cyclohexane-l-carboxylate (SMCC) (if the 
peptide lacks a sulfhydryl group, this can be provided by 

25 addition of a cysteine residue.) These reagents create a 
disulfide linkage between themselves and peptide cysteine 
resides on one protein and an amide linkage through the 
e -amino on a lysine, or other free amino group in other 
amino acids. A variety of such disulfide /amide -forming 

30 agents are known. See, for example, Immun. Rev. (1982) 
£2:185. Other bifunctional coupling agents for a 
thioether rather than a disulfide linkage. Many of these 
thi»-efeher- forming agents are commercially available and 

include_reac.tiyje_e3ters_of__6-maleimidoca proic acid. 2- 

35 bromoacetic acid, 2-iodoacetic acid, 4- (N-maleimido- 
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methyl) cyclohexane-l-carboxylic acid, and the like. The 
carboxyl groups can be activated by combining them with 
succinimide or 1 -hydroxyl -2 -nitro- 4 -sulfonic acid, sodium 
salt. Additional methods of coupling antigens employ the 
5 rotavirus/ "binding peptide" system described in EPO 

Publication No. 259,149. The foregoing list is not meant 
to be exhaustive, and modifications of the named 
compounds can clearly be used. 

Any carrier may be used which does not itself 

10 induce the production of antibodies harmful to the host. 
Suitable carriers are typically large, slowly metabolized 
macromolecules such as proteins; polysaccharides such as 
latex functionalized sepharose, agarose, cellulose, 
cellulose beads and the like; polymeric amino acids, such 

15 as polyglutamic acid, polylysine, and the like; amino 
acid copolymers; and inactive virus particles (see 
infra.) . Especially useful protein substrates are serum 
albumins, keyhole limpet hemocyanin, immunoglobulin 
molecules, thyroglobulin, ovalbumin, tetanus toxoid, and 

20 other proteins well known to those of skill in the art. 

The immunogenicity of the epitopes of the HCV 
variable domains, particularly of El and E2/NS1, may also 
be enhanced by preparing them in eukaryotic systems fused 
with or assembled with particle- forming proteins such as, 

25 for example, that associated with hepatitis B surface 
antigen. See, e.g., U.S. Patent No. 4,722,840. 
Constructs wherein the polypeptide containing the HCV 
epitope from a variable domain is linked directly to the 
particle- forming protein coding sequences produces 

30 hybrids which are immunogenic with respect to the HCV 
epitope. In addition, all of the vectors prepared 
include epitopes specific to HBV, having various degrees 
of immunogenicity, such as, for example, the pre-S 
peptide. Thus, particles constructed-from-particle 



35 



forming protein which include HCV sequences are 
immunogenic with respect to HCV and HBV. 

Hepatitis surface antigen (HBSAg) has been 
shown to be formed and assembled into particles in £. 
cerevisiae (Valenzuela et al. (1982), Nature 298 :344. as 
well as in, for example, mammalian ceils (Valenzuela et 
al. (1984), in "Hepatitis B", Millman I. et al., ed.). 
The formation of such particles has been shown to enhance 
the immunogenicity of the monomer subunit. The 
constructs may also include the immunodominant epitope of 
HBSAg, comprising the 55 amino acids of the presurface 
(pre-S) region. Neurath et al. (1984). Constructs of 
the pre-S-HBSAg particle expressible in yeast are 
disclosed in E.P.O. Publication No. 174,444; hybrids 
including heterologous viral sequences for yeast 
expression are disclosed in E.P.O. Publication No. 
175,261. These constructs may also be expressed in 
mammalian cells such as CHO cells using an SV40- 
dihydrofolate reductase vector (Michelle et al. (1984)). 

In addition, portions of the particle- forming 
protein coding sequence may be replaced with codons 
encoding an epitope from an HCV variable domain. In this 
replacement, regions which are not required to mediate 
the aggregation of the units to form immunogenic 
particles in yeast or mammals can be deleted, thus 
eliminating additional HBV antigenic sites from 
competition with the HCV epitope (s) . 

The preparation of vaccines which contain an 
immunogenic polypeptide (s) as an active ingredient (s) is 
known to one skilled in the art. Typically, such 
vaccines are prepared as in jec tables, either as liquid 
solutions or suspensions; solid forms suitable for 
solution in, or suspension in, liquid prior to injection 
may also be prepared, the preparation may also be 
emulsified, or the polypeptide (s) encapsulated in 
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liposomes. The active immunogenic ingredients are often 
mixed with excipients which are pharmaceutically 
acceptable and compatible with the active ingredient. 
Suitable excipients are, for example, water, saline, 
5 dextrose, glycerol, ethanol, or the like and combinations 
thereof. In addition, if desired, the vaccine may 
contain minor amounts of auxiliary substances such as 
wetting or emulsifying agents, pH buffering agents, 
and/or adjuvants which enhance the effectiveness of the 

10 vaccine. Examples of adjuvants which may be effective 
include, but are not limited to: aluminum hydroxide, N- 
acetyl-nnirarayl-L-threonyl-D-isoglutainine (thr-MDP) , N- 
acetyl - nor-muramyl - L - alanyl - D - isoglutamine (CGP 11637), 
referred to as nor-MDP) , N-acetylmuramyl -L-alanyl -D- 

15 isoglutaminyl-L- alanine -2- (1/ -2 • -dipalmitoyl- sn-glycero- 
3^hydroxyphosphoryloxy) -ethylamine (CGP 19835A, referred 
to as MTP-PE, and RIBI, which contains three components 
extracted from bacteria, monophosphoryl lipid A, 
trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) 

20 in a 2% squalene/Tween 80 emulsion. The effectiveness of 
an adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide 
containing an HCV epitope from a variable domain, the 
antibodies resulting from administration of this 

25 polypeptide in vaccines which are also comprised of the 
various adjuvants. 

The proteins may be formulated into the vaccine 
as neutral or salt forms. Pharmaceutically acceptable 
salts include the acid addition salts (formed with free 

30 amino groups of the peptide) and which are formed with 
inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or organic acids such as acetic, 
oxalTcT tartaric, maleic, and the like. Salts formed 

w ith-tJie-free-carboxyl-groups-may-a-lso-be-derived-from 

35 inorganic bases such as, for example, sodium, potassium, 
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ammonium, calcium, or ferric hydroxides, and such organic 
bases as isopropylamine, tr imethyl amine , 2-ethylamino 
Sthanol, histidine, procaine, and the like. 

The vaccines are conventionally administered 
5 parenteral ly, by injection, for example, either 
subcutaneously or intramuscularly. Additional 
formulations which are suitable for other modes of 
administration include suppositories and, in some cases, 
oral formulations. For suppositories, traditional 

10 binders and carriers may include, for example, 

polyalkylene glycols or triglycerides; such suppositories 
may be formed from mixtures containing the active 
ingredient in the range of 0.5% to 10%, preferably l%-2%. 
Oral formulations include such normally employed 

15 excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate, and the like. 
These compositions take the form of solutions, 
suspensions, tablets, pills, capsules, sustained release 

20 formulations or powders and contain 10%-95% of active 
ingredient, preferably 25%- 70%. 

In addition to the above, it is also possible 
to prepare live vaccines of attenuated microorganisms 
which express recombinant polypeptides of the HCV antigen 

25 sets. Suitable attenuated microorganisms are known in 

the art and include, for example, viruses (e.g., vaccinia 
virus) as well as bacteria. 

The vaccines are administered in a m a n ner 
compatible with the dosage formulation, and in such 

30 amount as will be prophylactically and/ or therapeutically 
effective. The quantity to be administered, which is 
generally in the range of 5 M9 to 250 of antigen per 
doser—depende on the subject to be treated, capacity of 

the subject's immune system to synthesize antibodies, and 

35 the degree of protection desired. Precise amounts of 
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active ingredient required to be administered may depend 
on' the judgment of the practitioner and may be peculiar 
to each individual. 

The vaccine may be given in a single dose 
5 schedule, or preferably in a multiple dose schedule. A 
multiple dose schedule is one in which a primary course 
of vaccination may be with 1-10 separate doses, followed 
by other doses given at subsequent time intervals 
required to maintain and/or reenforce the immune 
10 response, for example, at 1-4 months for a second dose, 
and if needed, a subsequent dose(s) after several months. 
The dosage regimen will also, at lest in part, be 
determined by the need of the individual and be dependent 
upon the judgment of the practitioner. 
15 in addition, the vaccine containing the antigen 

sets comprised of HCV polypeptides described above, may 
be administered in conjunction with other 
immunoregulatory agents, for example, immune globulins. 

The compositions of the present invention can 
20 be administered to individuals to generate polyclonal 
antibodies (purified or isolated from serum using 
conventional techniques) which can then be used in a 
number of applications. For example, the polyclonal 
antibodies can be used to passively immunize an 
25 individual, or as immunochemical reagents. 

In another embodiment of the invention, the 
above-described immunoreactive compositions comprised of 
a plurality of HCV antigen sets are used to detect 
ant i- HCV antibodies within biological samples, including 
30 for example, blood or serum samples. Design of the 

immunoassays is subject to a great deal of variation, and 
a variety of these are known in the art. However, the 
immunoassay will use antigen sets wherein each antigen 
selT-consistrof-a^ 



35 polypeptides comprising the amino acid sequence of an 
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epitope within a first variable domain of an HCV isolate, 
and the amino acid sequence of one set is heterogeneous 
with respect to the amino acid sequence of at least one 
other set. Protocols for the immunoassay may be based, 
4 5 for example, upon competition, or direct reaction, or 

sandwich type assays. Protocols may also, for example, 
* use solid supports, or may be by immunoprecipitation. 

Most assays involve the use of labeled antibody or 
polypeptide; the labels may be, for example, fluorescent, 

10 chemilumine scent , radioactive, or dye molecules. Assays 
which amplify the signals from the probe are also known; 
examples of which are assays which utilize biotin and 
avidin, and enzyme -labeled and mediated immunoassays, 
such as ELISA assays. 

15 Kits suitable for immunodiagnosis and contain- 

ing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the 
compositions of the invention containing HCV epitopes 
from variable domains, in suitable containers, along with 

20 the remaining reagents and materials (for example, 

suitable buffers, salt solutions, etc) required for the 
conduct of the assay, as well as a suitable set of assay 
instructions. 

Described below are examples of the present 

25 invention which are provided only for illustrative 
purposes, and not to limit the scope of the present 
invention. In light of the present disclosure, numerous 
embodiments within the scope of the claims will be appar- 
ent to those of ordinary skill in the art. 

30 

* 



35 
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Examples 

i In the Examples the following materials and 

methods were used. 
Patient Samples anri , RNA Extraction 
5 Asymptomatic HCV carriers HCT 18 and HCV Jl and 

chronically infected HCV patient Th have been previously 
described in Weiner et al. (1991) Virol. 180:842-848. 
Patient Q was diagnosed with chronic active hepatitis 
based on a liver biopsy and was placed on alfa-2b 

10 interferon therapy (3 million units, thrice weekly) for 
six months. RNA from 0.2 ml of plasma was extracted 
according to the method of Chomcynski and Sacchi, (1987) 
Anal. Biochem. 162.: 156 -159, using RNAzol™ B reagent 
(Cinna/Biotecx Laboratories) containing 10 ng/ml MS2 

15 carrier RNA (Boehringer Mannheim, 165-948) as indicated 
by the manufacturer. RNA was resuspended in 200 fil of 
diethyl pyrocarbonate treated distilled water and 
reprecipitated in a final concentration of 0.2M sodium 
acetate and two and one half volumes of 100% ethanol 

20 (-20°C). 

r.DNA and Polymerase Chain Reactions 

All reactions were performed according to 
Weiner et al. (1990) Lancet 115.: 1-5. M13 sequencing was 
25 performed according to Messing et al. (1983), Methods in 
Enzymology 101:20-37. The consensus sequence of at least 
four cloned inserts are presented with the exception of 
the HCV J1.2 E2/NS1 sequence which was derived from two 
clones . 

30 Cloning and sequencing of HCT 18 and Th was as 

reported in Weiner et al. (1991) , supra. Nested PCR 
primers used to clone the amino terminal and carboxy 
proximal segments of E2/NS1 in patient Q were: 



PCR I 

35 X(E2)14 GGTGCTCACTGGGGA6TCCT(1367-1386)S 
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X(B2) 18J CATTGCAGTTCAGGGCCGTGCTA (1608-1588)A, 

^ PCR II 

X(E2)4 TCCATGGTGGGGAACTGGGC ( 1406 - 1425 ) S 
X(E2)19J TGCCAACTGCCATTGGTGTT (1582 - 1562 ) A; 

5 PCR I 

X(E2) 14 (above) S 

Jlrcl2 TAACGGGCTGAGCTCGGA(2313-2296)A 

PCR II 

US(E2}5 CAATTGGTTCGGTTGTACC (I960- 1978) S 
10 Jlrcl3 CGTCCAGTTGCAGGCAGCTTC (2260-2240) A. 

PCR primers used to clone the HCV Jl E2/NS1 gene were: 
PCR I 

JKE2J14 (above) S 

Jl(E2)rc30" CAGGGCAGTATCTGCCACTC(2349-2330)A 
15 J1IZ-2* TGAGACGGACGTGCTGCTCCT(1960-1978)S 

Jl(E2)rc32~ TTTGATGTACCAGGCGGCGCA (2658-263 6)A 
PCR II-E2384.5* 

GGATCCGCTAGCCATACCCGCGTGACGGGGGGGGTGCAA ( 14 6 9 - 
1495)S 

20 DSC0N1JBX* 

GGATCCTCTAGATTACTCTTCTGACCrATCCCTGTCCTCC^AGTC 
ACA(2272-2301)A 

J1IZ-1* CAACTGGTTCGGCTGTACA ( 19 15 - 19 3 5 ) S 

Jl(E2)rc3l" (2566-2546) A. 
25 nt sequence from Takeuchi et al . , (1990) Nucl . Acids 

Res. 1&:4626; *\ nt sequence from Kato et al., (1989) 
Proc. Jpn. Acad. 65B ;219-223. Sense (S) or antisense (A) 
PCR primers are given in the 5' to 3* orientation 
according nucleotide numbers in reference. 

30 

Synthesis of Biotinylated Peptides 

m The overlapping octapeptides for the 

hypervariable regions of three strains of HCV were 
synthesized on cleavable- linker, derivatized, 

35 
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polyethylene pins essentially as described by (Maeji et 
al., (1990) J. Immunol. Methods i3£:23-33, was coupled to 
the N- terminus of each peptide. Finally, biotin was 
coupled to the N- terminus using 150 pi of a 
5 dimethylformamide solution containing 40 mM biotin, 40 raM 
1-hydroxybenzotriazole (HOBt) , 40 mM 
benzotriazole- 1-yl - oxy- tris-pyrrlidino-phosphonium 
hexafluorophosphate (PyBOP, NOVABIOCHEM) and 60 mM 
N-methylmorpholine (NMM) reacting overnight at 20 'C. 
!0 After biotinylation, the peptides were 

side- chain deprotected, washed and the peptide from each 
pin was cleaved in 200 pi of 0.1M phosphate buffer (pH 
7.2) . Microtitre plates containing the cleaved peptide 
solutions were stored at -20 *C until needed. 

15 

wr.Tfia Testing of Biotinvlated Peptides 
Polystyrene plates (Nunc immuno plate maxisorb 
F96) were coated with streptavidin by incubating 
overnight at 4°C with 0.1 ml/well of a 5 pg/tol solution 

20 of streptavidin (Sigma Cat. No. S4762) in 0.1 M carbonate 
buffer at pH 9.6. After removal of the streptavidin 
solution, the wells were washed four times with a 0.1% 
solution of Tween 20 in PBS. Nonspecific binding was 
blocked by incubating each well with 0.2 ml of 2% BSA in 

25 PBS for 1 h at 20 °C. The wells were again washed four 
times with PBS/Tween 20. Plates were air-dried and 
stored at 4°C until required. The streptavidin in each 
well was coupled to cleaved peptides by incubation with 
100 pi of a 1:100 dilution of cleaved peptide solution 

30 with 0.1% BSA in PBS containing 0.1% sodium azide for 1 h 
at 20°C. After incubation, the plate was washed four 
times with PBS/Tween 20. Each well was incubated with 
100 of a suitable dilution of serum (diluted with 2% 

BSA-in-PBS-containing-0-l%-sod-ium-azide)^or_l_hLat_20^ 

35 or overnight at 4°C followed by four washes with 
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PBS/Tween 20. Bound antibody was detected by reaction 
for 1 h at 20 °C in 0.1 ml conjugate. This consisted of 
TO. 25 ml/1 (a saturating level) of horseradish peroxidase- 
labeled goat anti- rabbit IgG (H+L) (Kirkegaard and Perry 
5 Labs, Gaithersburg, MD) in CASS (0.1% sheep serum, 0.1% 
Tween 20, 0.1% sodium caseinate diluted in 0.1M PBS, pH 
7.2) . The wells were washed 2 times with PBS/Tween 20 
followed by two washes with PBS only. The presence of 
enzyme was detected by reaction for 45 min at 20' C with 

10 O.lml of a freshly-prepared solution containing 50 mg of 
ammonium 2,2' -azino-bis [3 -ethylbenzothiazoline- 
6-sulphonate (ABTS, Boehringer Mannheim Cat. no. 122661) 
and 0.03 ml of 35% (w/w) hydrogen peroxide solution in 
100 ml of 0.1 M phosphate/ 0.0 8 M citrate buffer, pH 4.0. 

15 Color development was measured in a Titertek Multiscan MC 
plate reader in the dual wavelength mode at 405 nm 
against a reference wavelength of 492 nm. 

Computer Generated Antigenicity profile 

20 . Antigenicity profiles for the HCV B2/NS1 

protein and HIV-1 gpl20 hypervariable region V3 (aa 303- 
338) were derived from a computer program based on the 
degree of sequence variability as originally proposed by 
Kabat [Sequences of proteins of immunological interest. 

25 U.S. Department of Health and Human Services, Public 

Health Service, National Institutes of Health (1983)] for 
the identification of the hypervariable loops of 
immunoglobulins multiplied by the average of the 
individual probability that antibody binding is retained 

30 for each possible pair-wise amino acid. Probabilities 

for retention of antibody binding associated with a given 
amino acid change were the values experimentally 
determined by assessing the effects on antibody binding 
of all possible amino acid substitutions for 103 

35 characterized linear epitopes. Geysen et al., (1988) J. 
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Mol- Rec. 1:32-41. This algorithm thus weights the 
variability index to give more significance to amino acid 
changes likely to have a significant effect on antibody 
binding, i.e., compensates for conservative amino acid 
5 changes. Fifteen HCV sequences [HCV-1, Q3.2, HCT 23, 

EC10, HC-J1, HCVE1, TH, HCT 27, Q1.2, HCT18 , HC-J4, HCV 
J1.2/HCV Jl.l, HCV J , HCV BKJ , were used to determine 
the antigenicity profile for HCV. The HXV-l V3 profile 
was obtained by averaging 242 individual profiles of 15 
10 sequences selected at random from the numerically greater 
data base of unique HIV-1 sequences. LaRosa et al., 
(1990) Science 249:932-935 & Correction in Science (1991) 
p. 811. The amino acid sequences of some of these 
isolates between aa 384 and 420 are shown in Figure 3. 



15 



Com puter g enerated Secondary Structure 
predictions 



The ff-helix, /5-sheet and 0-turn secondary 
structure probabilities for the amino -terminal region 

20 (384-420) were determined using an algorithm, which 
assigns the probabilities for each of the three above 
secondary structural motifs to each residue. The 
coefficients used in the algorithm were obtained for all 
pair-wise combinations of residues of the structural data 

25 base. Levitt and Greer, (1977) J. Mol. Biol. 

114 ; 181-293. The prediction parameters obtained from 
these coefficients were fitted to the observed outcome 
when the algorithm was applied back on the database to 
obtain probabilities that a given residue would be found 

30 in one of the three defined secondary structural motifs. 
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Example 1 

Comparison of Secondary Structure and Amino 
Acid Sequence Variation in the HCV E2/NS1 HV 
5 and HIV-l crp!20 Domains 

The amino acid sequences from fifteen HCV and 
HIV-l isolates were compared with respect to the number 
of positions at which amino acid sequence heterogeneities 

10 were observed in the HCV E2 HV or HIV-l gpl20 V3 domains 
(Figure 4, A and B, respectively). Amino acid 
heterogeneities occurred in 25 of 30 amino acid positions 
in the B2 HV region and 23 of 35 amino acid positions in 
the HIV-l gpl20 V3 domain. Dashes on the x-axis of 

15 Figure 4 A and B represent amino acid positions where 
variable amino acid residues occur and invariant amino 
acids are given in the single letter amino acid code. 
The antigenicity profiles shown in Figure 4 indicate 
that, similar to the V3 loop of the HIV-l gpl20 protein 

20 (Figure 4B) , a block of amino acid residues in the HCV E2 
(amino acids 384-414 in Figure 4A) was identified whose 
variation had a predicted adverse affect on antibody 
binding. The data in figure 4 indicate that the HCV B2 
domain resembles the HIV-l gpl20 V3 domain, which is 

25 known to encode virus neutralizing epitopes, in both the 
degree and predicted significance of observed amino acid 
variation and suggests that the E2 HV domain may have a 
similar function as the gpl20 V3 domain. 

Linear epitopes are more likely associated with 

30 less structured regions of proteins, in particular, the 
ends of proteins or with extended surface loops. A 
computer analysis was used to predict the probability 
th eft ai r individual residue is associated with a defined 
secondary structural motif for 15 E2 HV amino acid 

35 sequences between residues 384 to 420. Figure 4 shows 
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that the region between the E2 amino -terminal residue 384 
and the strongly predicted, highly conserved beta- turn 
(residues 415-418) is relatively unstructured as 
indicated by less than 50 percent probability of 
5 alpha- helix, beta- sheet or beta -turn character. Lack of 
strongly predictive structure in the E2 HV domain is 
consistent with the tolerance for extensive sequence 
variation found between isolates and is in contrast with 
highly structured regions which contribute to tertiary 

10 folding of the protein. The HCV E2 HV domain appears to 
be even less structured than the V3, principal 
neutralizing domain of HIV-1 gpl20, which has been 
reported to contain a beta strand- type II beta turn-beta 
strand- alpha helix motif and may have greater structural 

15 constraints on amino acid variability than the HCV B2 HV 
domain. Taken together, the evidence suggests that the 
B2 HV domain appears to have features characteristic of 
protein domains which contain likely sites of linear 
neutralizing epitopes. 

20 

Tgpii-np fi Mapping of the p rv E2/NS1 HV Domain 

Overlapping biotinylated 8-mer peptides 
corresponding to and extending past the E2/NS1 HV domain 

25 (amino acids 384 to 416) of HCT 18 (A,D), Th (B,E) and 
HCV Jl (C,F) were bound to plates coated with 
streptavidin and reacted with plasma from either HCT 18 
(A-C) or Th (D-F) . The results are shown in Figure 6 for 
HCV isolates HCT 18 (Fig. €A and 6D) , Th (Fig. SB and 

30 6E) , and HCV Jl (Fig. 6C and 6F) . HCT 18 plasma was 

diluted 1:200 and Th plasma was diluted 1:500. HVE-1, 
.2, -3 . -4 and -5, represent isolate specific epitopes. 

As seen from Figure 6, HCT 18 plasma identi- 
f ied a linear epitope (^PKQNV 411 ) wnen tested-with 

35 
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peptides derived from the HCT18 sequence (HVE-i in 
Figure 6A) , but failed to react with peptides corres- 
"ponding to the HV domain of two different strains Th and 
HCV Jl (Figures 6B and 6C) . In contrast, Th plasma 
5 identified linear epitope HVE-IV in the HV domain of Th 
(^QNIQLI 41 *, Figure 6E) , and also epitopes in strain HCT 
18 (^IVRFFAP 405 , Figure 6D) and HCV Jl. Th, an IV drug 
user, may have been exposed to multiple strains of HCV. 

Both Th and HCT 18 plasma each reacted with an 

10 epitope (amino acids 413-419) common to all three 

isolates (data not shown) when used in an ELISA with pin 
synthesized overlapping 8mer peptides from each isolate. 

In-order to validate antibody binding 
specificity, antibodies bound to biotinylated peptides 

15 containing amino acids 403-407 were eluated and used to 
block the reactivity of HCT 18 plasma with pins 
containing overlapping 8-mers for the HCT 18 HV domain. 
These data indicate that 1) the E2/NS1 HV domain is 
immunogenic, 2) there are multiple epitopes which map to 

20 this region, and 3) a subset of epitopes (HVE-l, -2, -3, 
-4 or -5 in Figure 6) in the HV domain are isolate 
specific . 

Example ,3 

25 Determination that Variant E2/NS1 HV Domains 

Can Be Associate^ With, F i ares Qf Hepatitis 

To investigate the possibility of finding HCV 
variants associated with the intermittent flares of 
hepatitis often found in chronic HCV infections, we 

30 partially sequenced the E2/NS1 gene from a patient, Q, 
with chronic hepatitis during two distinct episodes of 
hepatitis approximately two years apart (Ql and Q3, 
respectively) . The second episode of hepatitis occurred 

lT5-years _ after-the-terndnation-of-interferon-treatment— 
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The differences in the deduced amino acid 
sequence of the Ql and Q3 E2/NS1 HV region was strikingly 
different only between amino acids 391-408 with seven of 
eight changes occurring between amino acid 398 and 407 
5 (Figure 7) . Figure 7 shows the deduced amino acid 

sequences of two regions of the E2/NS1 polypeptide, amino 
acids 384-414 and 547-647, for the Ql and Q3 isolates. 
The amino acid (E) above the Ql sequence was found in one 
of four Ql clones. The boxed amino acids represent the 
10 location of the Ql or Q3 HVE 12mer peptide. Amino acid 
sequence differences found between Ql and Q3 are printed 

in bold type. 

Only one amino acid heterogeneity was observed 
between amino acids 547 and 647 of the Ql and Q3 E2/NS1 

15 polypeptides (Figure 7) . 

To examine the effect of the amino acid 
substitutions observed in the Ql and Q3 E2 HV domains on 
antibody binding, we synthesized a Ql and Q3 specific 
12-mer peptide from amino acids 396 to 407 (HVE Ql or Q3 

20 in Figure 7B) and separately reacted the Ql and Q3 plasma 
with each peptide in an EL ISA. Table 4 shows that 
antibodies in both the Ql and Q3 plasma reacted with the 
Ql peptide but not with the Q3 peptide. Statistical 
analysis (Student's Test) indicated that the binding of 

25 the Q1/Q3 plasma to the Ql peptide was significantly 

above background binding of those plasma to a panel of 12 
randomly chosen control peptides (P<0.001), while binding 
of either the Ql or Q3 plasma to the Q3 peptide was not 
statistically significant. The data indicate that 

30 although patient Q developed antibodies to the HCV Ql HV 
domain,, which were still detectable two years later at 
the Q3 t ime point, no detectable humoral response had 
developed to the Q3 E2 HV variant which was predominant 

auring _ the _ second-episode-of-hepatitis^ 
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Table 4 

Eliaa Res ults on 12-mer Peptides 



Plasma 

Ql 
Q3 



TARFAGFFQSGA 

Ql seq 
Mean sd 
1.158 0.134 
1.022 0.123 



TAGFVRLFETGP 

Q3 seq 
Mean sd 
0.691 0.123 
0.693 0.036 



10 



15 



20 



25 



30 



Example 4 

Detection of Coexisting E2/NS1 Genes With 
pjstinct E2/FS1 HV pomajna in HCV frafectefl 
Indiv idu al s 

Figure 8A shows the amino acid sequences 
deduced from two isolates of HCV Jl (Jl.l & Jl.2) which 
were cloned from one plasma sample of the Japanese 
volunteer blood donor HCV Jl. Kubo et al., (1989) Nucl. 
Acids Res. 12:10367-10372. Of the 23 total amino acid 
changes between HCV Jl.l and HCV Jl.2, 9 differences 
indicated by bold type are clustered in the 30 amino acid 
E2/NS1 HV domain. Five of the 9 amino acid substitutions 
in the B2/NS1 HV domain represent nonconservative amino 
acid changes. Since HCV Jl is the only group II HCV 
genome which has been cloned in our laboratory, it is 
unlikely that these differences are due to cross contami- 
nation of the HCV Jl plasma. The HCV Jl.2 sequence 
represents a minority sequence in HCV Jl's blood since 
only two E2/NS1 HV variant sequences were identified from 
7 cloned sequences which originated from two independent 
PCR reactions. 

Interestingly, a comparison of the HCT27 and 
HCV, El isolates (Figure 8B) , which were sequenced in 
different laboratories and derive from presumably 



35 



unrelated individuals, showed that the number of amino 
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10 



acid differences in the E2/NS1 HV domain of these 
isolates were fewer than the number of differences 
observed between isolates from the same individual. 

The above described results lead to the 
suggestion that the HCV genome is rapidly evolving in 
individuals and the population. 

R-rampIg 5 

Pormulat.-i on and Pr eparation of Vaccine 



r uling nf the Di phtheria Toxoid Carrier Prptqin to MGS 
Materials Required 

ethylene diamine tetra-acetic acid (EDTA 9B,.2H*» (MW 
372) 

6-maleimido-caproic acid N-hydroxysuccinimide ester (MCS) 
15 (Sigma) - 95* pure 

sodium dihydrogen orthophosphate (NaHjPOJ 
nitrogen 

dimethylformamide (DMF) 

ota^M phosphate buffer containing 5 mM EDTA, pH 6.66 
0.1 M phosphate buffer, pH 8.0 
0.1 M phosphate buffer, pH 7.0 
20 sodium succinate [ (O^COONa) 2 .6H£] 
cysteine 

hydrochloric acid (2% solution) 

0.1 M sodium succinate/0.1 EDTA, pH 5.6 

Purified diphtheria toxoid (Commonwealth Serum 
Laboratories, Victoria, Australia) was coupled to MCS 

25 according to the method described by Lee et al., (1980) 
Mm . Tmmunol. 17:749; Partis et al., (1983) Prpfc. Chem. 
2:263; Peeters et al., (1989) <J M Immunol. Methods 
120:133; Jones et al., (1989) t T , IfflPWQl- MStnoda 
123:211. 100 ml of diphtheria toxoid was passed through 

30 a G25 Sephadex column (17cm X 4 cm) to remove thiomersal. 
The toxoid was eluted with 0.1 M phosphate buffer pH 7.0 
and-the protein content of the eluate was assayed using 
th? "Rf?A protei n determination (Pierce) . The resulting 
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solution was concentrated using an Amicon ultrafiltration 
unit to a final concentration of 10 mg/ml. 

One milliliter of the toxoid solution was 
dialyzed with 0.1 M phosphate buffer, pH 8.0, and then 
5 mixed with a solution of 1.5 mg MCS in 200 fil DMF. The 
resulting solution was incubated at room temperature for 
1 hour in the dark with occasional mixing. In order to 
separate the uncoupled MCS from the MCS -toxoid, the 
solution was passed through a Sephadex PD10 column which 
10 had been equilibrated with 0.1 M phosphate buffer, pH 
6.66 and the protein fraction was collected. 

The number of maleimido groups coupled per 
carrier molecule was determined prior to coupling of the 
HCV peptides thereto. Thirty milliliters of the 
15 succinate/EDTA buffer was sparged with nitrogen for 2 
minutes. Five milligrams of cysteine was transferred 
. into a 25 ml volumetric flask and dissolved in a final 
volume of 25 ml of the sparged buffer. Aliquots of the 
solutions shown in Table 5 were transferred in duplicate 
20 to 25 ml screw capped bottles. Using separate pipettes, 
nitrogen was bubbled into each aliquot. Bach bottle was 
then sealed and incubated at room temperature in the dark 
for 40 minutes with occasional swirling. 

Table 5 

25 Solution Sample (ml) Standard (ml) Blank (ml) 

activated carrier 0.3 

phosphate buffer - 0.3 0.3 

cysteine solution l.o 1.0 

succinate buffer - - 1.0 

* A 0.1 ml aliquot of each of the 3 solution was taken 
30 for an Ellman's determination. 
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Ellman's T ^st for the Quanti tative Determination of 

SulfiftVflSYl 
Materials Required 

Phosphate buffer, pH 8.0 

Dissolve 15.6 g NaH 2 P0 4 or 12.0 g 
Nal^PO, anhydrous in approximately 700 
ml Milli Q water. Adjust the pH to 
8.0 using 50% NaOH. Add Milli Q water 
for a final volume of 1000 ml and 
then adjust the pH if necessary. 

Ellman's Reagent 

Dissolve 10.0 mg of 5, 5' -dithiobis-2- 
nitrobenzoic acid (DTNB) in 2.5 ml of phosphate 
buffer, pH 8.0 

0.1 ml of Ellman's reagent was added to each of 
the 0.1 ml aliquots of the solutions prepared above, 
namely the sample, standard and bland solutions. Five 
milliliters of phosphate buffer, pH 8.0, was then added 
to each aliquot, mixed well and allowed to stand for 15 
minutes. The absorbance of each aliquot was measured in a 
1 cm path length cell at 412 nm. 

The number of maleimido groups present on the 
carrier protein was determined according to the following 
method. A 0.01 pmol per ml solution of -SH produces an 
absorbance of 0.136 in a 1 cm light path at 412 nm. The 
absorbance of the Standard or Sample (A) is equal to the 
amount of cysteine reacted with the coupled maleimido 
groups on the activated carrier protein. Since 1 mol of 
available -SH reacts with 1 mol of maleimido, the 
concentration in pmols of the maleimido groups present in 
the aliquot tested is equal to A (0. 01) /0. 136 /nnol/ml. 
The total volume of the solution was 5.2 ml. Therefore, 
the total number of /imols present was equal to 
A(0.01) (5.2) /0. 136. The sample solution had a total 
volume of 1.3 ml, of which 0.3 ml consisted of the 
activated carrier protein. The amount of maleimido 
gr r >lir g_preagnt_in_the sa mple solution was calculated as 
A(0.01) (5.2) (1.3)/(0.136) (0.1) (0.3) = A(16.57) /imol/ml. 
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The MCS- activated carrier protein was .stored at' -20° C. 

Reduction o£ the HCV Peptides 

Prior to coupling of the HCV peptides to the 

5 MCS - activated carrier protein, the peptides were reduced 

to ensure that thiol groups present on the peptides were 

in the fully reduced -SH form. 

Ma t er ials Re q ui re d % 

dithiothreitol (DTT) 
10 ammonium hydrogen carbonate (NH4HCO3) 
methanol 

SEP-PAKs {CI 8 cartridge, Waters), 1 cartridge for each 8 

mg of peptide 
0.1 M ammonium hydrogen carbonate buffer 

Dissolve 7.9 g NH4HCO3 in 1 L Milli Q 

water 

Buffer A, 0.1% v/v trifluoroacetic acid (TFA) in Milli Q 
15 water 

Buffer B, 60% v/v acetonitrile, 0.1% v/v TFA in Milli Q 
water 

15 mg of each of two HCV peptides corresponding 
to amino acids 384-411 and 225-260, respectively, of the 
HCV polyprotein were added to 2.5 ml of 0.1 M ammonium 

20 

hydrogen carbonate containing a 10 fold molar excess of 
DTT. The resulting solutions were mixed until. the 
peptide had dissolved and were then allowed to stand for 
1 hour at room temperature. Two pairs of SEP-PAKs were 
connected in series and activated by passing 

25 

approximately 20 ml of methanol and then 20 ml of Buffer 
A through each pair of SEP-PAKs. Bach peptide /DTT sample 
was slowly passed through a pair of SEP-PAKs. The DTT 
was eluted with 20 ml of Buffer A. The reduced peptide 
was eluted with 7 ml of Buffer B into a pre-weighed 

30 

bottle and then freeze -dried overnight. The bottles were 
then weighed to determined the amount of recovered 
pep tide . The reduced peptides were then immediately 
coupled to the MCS -activated carrier protein. 
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mi lling HCV PfpMdes to Mrs- Activated Carrier Protein 
^ Approximately 100 ml of 0.1 M phosphate buffer 

with 5 mM EDTA, pH 6.66 was degassed under vacuum and 
then sparged with nitrogen for 10 minutes. Twenty 
5 milliliters of a 10 mg/ml solution of the MCS- activated 
carrier protein was carefully sparged with nitrogen to 
prevent excessive frothing. 5 mg of each of the reduced 
peptides were dissolved in approximately 0.2 ml of the 
degassed sparged phosphate/EDTA buffer, pH 6.66 and then 

10 mixed with the MCS -activated carrier protein solution. 

The resulting mixture was transferred into a screw capped 
bottle which was then filled with nitrogen and sealed. 
The solution was further degassed by holding the bottle 
in a Branson 2000® sonication bath for 2 minutes. The 

15 bottle was covered with aluminum foil and incubated 
overnight at room temperature with slow mixing on a 

shaker table. 

The resultant conjugate was soluble and the 
uncoupled peptide was removed by passing the mixture over 
20 a Sephadex PD 10 column which had been equilibrated with 
the phosphate/EDTA buffer, pH 6.66. The protein fraction 
was collected. The amount of peptide conjugated to the 
carrier protein was determined by amino acid analysis. 

An amino acid analysis of 150 pi aliquots of 
25 both the conjugate and the carrier protein was performed. 
The average ratio of the level of amino acids contributed 
solely by the carrier protein was determined to calculate 
the amount of conjugated peptide produced. Levels of 
serine, threonine, tryptophan, methionine, tyrosine and 
cysteine were not determined as these amino acids are 
modified under the standard hydrolysis conditions. 
Typical results obtained in these calculations are 
presented in Table 6. 
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Table 6 

AMINO ACID CARRIER ONLY CONJUGATE 

D 212 193 

E 194 170 

G 153 108 

R 60 56 

A 150 384 

P 79 163 

For the conjugate, the values in bold type are 

the amino acids that were also present in the peptides. 

For conjugates containing alanine and proline, the factor 

(193+179+180+56)/ (212) +194+153+60) - 0.8659 is multiplied 

by the amount of the amino acid level in order to 

normalize the result. 



Preparati on of Vaccine Composition 

• injectable compositions consisting of HCV 
peptides conjugated to MCS- activated diphtheria toxoid 
carrier protein prepared as described supra and a 
submicron oil-in-water emulsion adjuvant as described in 
PCT international Publication No. WO9014837, published 
December 13, 1990, which is incorporated by reference 
herein. In addition, injectable compositions containing a 
an immunbstimulant, lipophilic muramyl peptide (MTP-PE, 
CIBA-GEIGY, Basel, Switzerland) in addition to HCV 
conjugated peptides and adjuvant were prepared. .The 
vaccine compositions were generally comprised of 50% 
protein and 50% adjuvant. 



Formula for Vaccine Composition with MTP-PE 
30 To prepare 10 ml of injectable vaccine composition: 

2.5 ml Squalene (Sigma Chemical Co., St. Louis, Mo.) 
0.25 ml Tween 80 (Sigma Chemical Co.) 
O.tS-ml SPAN 85 (Sigma Chemical Co.) 
1000 Mg MTP-PE 

lOOon*g^CV~peptide _ conjugated-to-MCS-activated 

35 diphtheria toxoid carrier protein 
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Formula for Vaccine Composition without MTP-PE 

To prepare 10 ml of injectable vaccine composition: 

2.5 ml Squalene (Sigma Chemical Co., St. Louis, Mo.) 
0.25 ml Tween 80 (Sigma Chemical Co.) 
0.25 ml SPAN 85 (Sigma Chemical Co.) 
1000 ng HCV peptide conjugated to MCS-activated 
diphtheria toxoid carrier protein 



Example 6 
Method for Test ing Vaccine 
1° Preparations for Toxicity 

Vaccine prepared according to the methodology 
' of Example 5 was tested for toxicity in small animals. 
Fifty microgram per kilogram of vaccine was administered 
15 to guinea pigs, mice and rabbits by intraperitoneal 
injection. The vaccine was also administered by 
intraperitoneal injection to rhesus monkeys and primates. 
Half of the test population of rhesus monkeys and 
primates received 5 fig/kg doses of the vaccine, while the 
other half received 50 jig/kg dosages. Control animals 
employed in each of the studies were injected with a 
comparable amount of a composition consisting of the 
components of the vaccine preparation except the viral 
peptides. 

25 Each of the animals was monitored for symptoms 

indicative of a response to toxic material. More 
specifically, each animal in the study was examined bi- 
weekly for symptoms including fever, lethargy, weight 
loss, changes in eating habits and for lesions, swelling 

30 or tenderness at the site of injection. Lymph nodes 
proximal to the injection site were also examined for 
swelling and/or drainage. The animals were monitored on 

a-bi-weeKly -basis for a p eriod of several months. 
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Example 7 

Demonstration of the Production of 
Neutralizing Antibody in Vaccinated Animals 



^ Vaccine prepared according to the methodology 

of Example 5 was tested in chimpanzees in order to 
determine the effectiveness of the vaccine in eliciting 
the production of virus neutralizing antibody in 
vaccinated subjects. Chimpanzees were vaccinated with 5 
10 jig/ kg dosages of vaccine prepared according to the 

methodology of Example 5 over a six-month time period at 
intervals of 0, 1, 3 and 6 months. Control chimpanzees 
were injected with comparable amounts of a composition 
consisting of the components of the vaccine except the 
15 viral peptides. Two weeks after the last dose, of vaccine 
was administered, the test and control chimpanzees were 
each challenged with a 10 CIU M (Chimpanzee Infectious 
Unit) dose of CDC/ 9 10 plasma inoculum. Commencing one 
week following the viral challenge, each of the 
2 q chimpanzees was monitored for viremia on a weekly basis. 

In order to detect viremia, blood samples and 
liver biopsy specimens were collected from control and 
test animals on a weekly basis for several months. 
Tissue collected by liver biopsy was examined 
25 histologically for signs of necrosis and/or inflammation. 
In addition, hepatocytes from the biopsy material were 
examined by electron microscopy for the presence of 
tubules characteristic of HCV infection. The blood 
samples were also analyzed by the ELISA assay described 
supra for the presence of antibodies to segments of viral 
polypeptides which were not utilized in preparing the 
vaccine. In particular, each of the blood samples was 
scr een ed by ELISA for the presence of antibodies to ns 3 , 
NSj and NS 5 pe ptides. The presence of antibodies to 
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these peptides in the serum of a chimpanzee was 
indicative of HCV inf ction. 

The following method was employed to detect 
viral RNA circulating in plasma or present in liver 
5 biopsy tissue collected from the chimpanzees. 

rvprr? Method to D e +*et hcv RNA in Liver and in Serum , 

In the cPCR assay, putative viral RNA in the 
sample is reverse transcribed into cDNA with reverse 
10 transcriptase; a segment of the resulting cDNA is then 
amplified utilizing a modified version of the PCR 
technique described by Saiki et al. (1986). The primers 
for the cPCR technique are derived from HCV RNA, which 
can be identified by the family of HCV cDNAs provided 
15 herein. Amplified product corresponding to the HCV-RNA 
is detected utilizing a probe derived from the family of 
HCV cDNAs provided herein. 

The cPCR/HCV assay used in these studies was 
performed utilizing the following methods for the 
20 preparation of RNA, the reverse transcription of the RNA 
into cDNA, the amplification of specific segments of the 
cDNA by PCR, and the analysis of the PCR products. 

RNA was extracted from liver utilizing the 
guanidium isothiocyanate method for preparing total RNA 
25 described in Maniatis et al. (1982) . 

in order to isolate total RNA from plasma, the 
plasma was diluted five- to ten-fold with TENB (0.1 M 
NaCl, 50 mM Tris-HCl, pH 8.0, 1 mM EDTA) and incubated in 
a Proteinase K/SDS solution (0.5% SDS, 1 mg/ml Proteinase 
30 K, 20 micrograms/ml Poly A carrier) for 60 to 90 minutes 
at 37°C. The samples were extracted once with phenol (pH 
6.5K_the resulting organic phase was re-extracted once 
with TENB containing 0.1% SDS, and the aqueous phases of 
both extrae^ iwi^weren?ool^^ 



IAJW1 &«vm.»»w- — — *• 

35 equal volume of phenol/CHCl 3 /isoamyl alcohol [1:1(99:1)] 
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The resulting aqueous phases wer extracted with an equal 
volume of ChC^/isoamyl alcohol (99:1) twice, and ethanol 
precipitated using 0.2 M s dium acetate, pH 6.5, and 2 5 
volumes of 100% ethanol; precipitation was overnight at 



-20 W C. 



The cDNA used as a template for the PCR re- 
action was prepared utilizing the designated samples for 
preparation of the corresponding cDNAs . Each RNA sample 
(containing either 2 micrograms of heat denatured total 
10 chimpanzee liver RNA or RNA from 2 microliters of plasma) 
was incubated in a 25 microliter reaction containing 1 
micromolar of each primer, l millimolar of each 
deoxyribonucleotide triphosphate (dNTP) , 50 millimolar 
Tris-HCL, P H 8.3, 5 millimolar MgCl 2 , 5 millimolar 
15 dithiothreitol (DTT) , 73 millimolar KC1, 40 units of 
RNase inhibitor (RNASIN) , and 5 units of AMV reverse 
transcriptase. The incubation was for 60 minutes at 
37 C. Following cDNA synthesis, the reactions were 
diluted with 50 microliters of deionized water (DIW) , 
20 boiled for 10 minutes, and cooled on ice. 

Amplification of a segment of the HCV cDNA was 
performed utilizing two synthetic oligomer 16-mer primers 
whose sequences were derived from HCV cDNA clones 36 
(anti-sense) and 37b (sense) . The sequence of the primer 
25 from clone 36 was: 

5' GCA TGT CAT GAT GTA T 3'. 

The sequence of the primer from clone 37b was: 

5' ACA ATA CGT GTG TCA C 3'. 
The primers were used at a final concentration of i 
micromolar each, in order to amplify the segment of hcv 
cDNA which is flanked by the primers, the cDNA samples 
were incubated with 0.1 microgram of RNAse A and the PGR 
reacfants of the Perkin Elmer Cetus PCR kit (N801-0043 or 
-N80i-0055)-according-to-the-^^ 
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The PCR reaction was performed for either 30 cycles or 60 
cycles in a Perkin Elmer Cetus DNA thermal cycler. Each 
cycle consisted of a 1 minute denaturation step at 94 C, 
an annealing step of 2 minutes at 37°C r and an extension 
5 step of 3 minutes at 72°C. However, the ext nsion step 
in the final cycle (30 or 60) was 7 minutes rather than 3 
minutes- After amplification the samples were extracted 
with an equal volume of phenol: chloroform (1:1), 
followed by extraction with an equal volume of 

10 chloroform, and then the samples were precipitated with 
ethanol containing 0.2 M sodium acetate. 

The cPCR products were analyzed as follows. 
The products were subjected to electrophoresis on 1.8% 
alkaline agarose gels according to Hurakawa et al. 

15 (1988), and transferred onto ZETA<* Probe paper (BioRad 
Corp.) by blotting gels overnight in 0.4 M NaOH. The 
blots were neutralized in 2 X SSC (lX^SC contains 0.15 
M NaCl, 0.015 M sodium citrate), prehybridized in 0.3 M 
NaCl, 15 mM sodium phosphate buffer, pH 6.8, 15 mM EDTA, 

20 1.0% SDS, 0.5% nonfat milk (Carnation Co.), and 0.5 mg/ml 
sonicated denatured salmon sperm DNA. The blots to be 
analyzed for HCV cDNA fragments were hybridized to a 
32 P-labeled probe generated by nick translation of the 
HCV cDNA insert sequence in clone 35, described in 

25 U.S. S.N. 07/456,637. After hybridization, the blots were 
washed in 0.1 X SSC (1 X SSC contains 0.15M NaCl, 0.01M 
Na citrate) at 65°C, dried, and autoradiographed. The 
expected product size is 586 nucleotides in length; 
products which hybridized with the probe and migrated in 

30 the gels in this size range were scored as positive for 
viral RNA. 

As a control, cPCR primers designed to amplify 
alpha^i anti-trypsin mRNA was performed to verify the 
presence-of-RNA-in-each-sample-analyzed^ — The coding 



35 region of the alpha-1 anti-trypsin gene is described in 
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Rosenberg et al. (1984) . synthetic oligomer 16-mer prim- 
ers designed to amplify a 365 nucleotide fragment of the 
~ coding region of the alpha-1 antitrypsin gene were 
derived from nucleotides 22-37 (sense) and nucleotides 
5 372-387 (antisehse) . The PGR products were detected 

using a 32 P nick-translated probe which lies between, and 
not including, the cDNA/PCR primer sequences. 

Due to the extreme sensitivity of the PCR re- 
action, all samples were run a minimum of three times. 

10 All false positive signals were eliminated when the fol- 
lowing precautions were taken: 1) eliminating aerosols by 
using screw capped tubes with rubber O-ring seals; 2) 
pipetting with Ranin MICROMAN* positive displacement 
pipetters with disposable pistons /capillaries; and 3) 

15 selecting the oligonucleotide sequences for the cDNA and 
PCR primers from two non-contiguous cDNA clones. 

Industrial Utility 

The immunoreactive compositions of the 

20 invention, have utility in the preparation of materials, 
for example, vaccines, which in turn may be used for the 
treatment of individuals against HCV infections, 
particularly chronic HCV infections. In addition, the 
compositions may be used to prepare materials for the 

25 detection of multiple variants of HCV in biological 

samples. For example, the immunoreactive compositions of 
the present invention can be used to generate polyclonal 
antibody compositions that recognize more than one HCV 
isolate, or as the antigen in an anti-HCV antibody 

30 immunoassay. The latter method can be used to screen 
blood products for possible HCV contamination. 
Polyclonal antiserum or antibodies can be used to for 
passive immunization of an individual. 
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i Claims 

WHAT IS CLAIMED IS: 
5 l. An immunoreactive composition comprising 

polypeptides wherein the polypeptides comprise the amino 
acid sequence of an epitope within a first variable 
domain of a hepatitis C virus (HCV) , and at least two 
heterogeneous amino acid sequences from the first 
10 variable domain of distinct HCV isolates are present in 
the composition. 

2. An immunoreactive composition according to 
claim 1 comprising a plurality of antigen sets, wherein 

15 (a) each antigen set consists of a plurality of 

substantially identical polypeptides comprising the amino 
acid sequence of an epitope within a first variable 
domain of an HCV isolate, and (b) the amino acid sequence 
of the epitope of one set is heterogeneous with respect 

20 to the amino acid sequence of the analogous sequence of 
at least one other set. 

3. An immunoreactive composition according to 
claim 1 wherein the first heterogeneous amino acid 

25 sequence is from an HCV group I isolate and the second 
heterogeneous amino acid sequence is from HCV group II 
isolate. 

4. An immunoreactive composition according to 
30 claim 1 wherein the variable domain is within the E2/NS1 

protein.. 

" 5. An immunoreactive composition according to 

cl^j^4^h^ein-t^ 
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amino acid 384 to about amino acid 411 of the HCV 
polyprotein. 

6. An immunoreactive composition according to 
5 claim 1 wherein the variable domain is within the El 

protein . 

7. An immunoreactive composition according to 
claim 6 wherein the variable domain is encoded from about 

10 amino acid 225 to about amino acid 260 of the HCV 
polyprotein. 

8. An immunoreactive composition according to 
claim 1 wherein the polypeptides further comprise the 

15 amino acid sequence of an epitope within a second 

variable domain of a hepatitis C virus (HCV) , and at 
least two heterogeneous amino acid sequences from the 
second variable domain of distinct HCV isolates are 
present in the composition. 

20 

9. An immunoreactive composition according to 
claim 8 wherein the first variable domain is within the 
E2/NS1 protein and the second variable domain is within 
the El protein. 

25 

10. An immunoreactive composition according to 
claim 1 comprising a plurality of polypeptides wherein 
each polypeptide has the formula 

^-(SVJ.-RV 

30 wherein 

R and R' are amino acid sequences of about 
1-2000 amino acids, and are the same or different; 

r and r' are 0 or 1, and are the same or 
different;— : : — 
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10 



15 



V is an amino acid sequence comprising the 
sequence of an HCV variable domain, wherein the variable 
domain c mprises at least one epitope; 

S in an integer > 1, representing a selected 

variable domain; and 

n is an integer > 1, representing a selected 
HCV isolate heterogeneous at a given SV with respect to 
at least one other isolate having a different value for 
n, and n being independently selected for each x; 

jc is an integer > 1; and 
with the proviso that amino acid sequences are present in 
the composition representing a combination selected from 
the group consisting of (i) lV t and 1V 2 , (ii) lV t and 2V 2 , 
and (iii) !V t and 2V t . 

11. The immunoreactive composition according 
to claim 10 wherein the polypeptide formula is 
I^-lVt-lVa-R',.. 

20 12. The immunoreactive composition according 

to claim 10 wherein the polypeptide composition comprises 
a mixture of polypeptides of the formulae 

I^-lVj-R'^ and 

I^-lVj-R',.. 

25 

13. A method for preparing an immunogenic 
composition for treatment of HCV comprising: 

(a) providing an immunogenic composition 

according to claim l; 
30 (b) providing a suitable excipient; and 

(c) mixing the immunogenic composition of (a) 
with the excipient of (b) . 
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14. A method for producing anti-HCV antibodies 
comprising administering to a mammal an effective amount 
of an immunoreactive composition according to claim 1. 

5 15. A polyCl nal antibody composition mad 

according to the method of claim 14. 

16. A method of detecting antibodies to HCV 
within a biological sample comprising: 

10 (a) providing a biological sample suspected of 

containing antibodies to multiple strains of HCV; 

(b) providing an immunoreactive composition 
according to claim 1; 

(c) reacting the biological sample of (a) with 
15 the immunoreactive composition of (b) under conditions 

which allow the formation of antigen-antibody complexes; 
and 

(d) detecting the formation of complexes 
formed between the antigen of (a) and the antibodies of 

20 the biological sample of (b) , if any. 

17. A kit for detecting antibodies to multiple 
strains of HCV within a biological sample comprising an 
immunoreactive composition according to claim l packaged 

25 in a suitable container. 

18. A DNA molecule encoding a polypeptide 
comprising two heterogeneous amino acid sequences from 
the same variable domain of distinct HCV isolates. 

30 

19. A host cell comprising a DNA molecule 
according to claim 18. 
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20. A h st cell according to claim 19 wherein 
the DNA molecule c mprises control sequences that are 
capable of causing the expression of the polypeptide. 

21. A method of making a recombinant protein 
comprising growing a population of host cells according 
to claim 20 under conditions that provide for the 
expression of the polypeptide. 
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